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MULTIGENE EXPRESSION VECTORS FOR THE BIOSYNTHESIS OF PRODUCTS 
VIA MELTIENZYME BIOLOGICAL PATHWAYS 



This application is based on United States Provisional Application No. 60/123,015, filed 
March 5, 1999. 

FIELD OF THE INVENTION 

The invention relates to the construction and use of multigene expression vectors useful 
to enhance production of materials by muhiensvrae pathways. In particular, the construction and 
use of multigene vectors encoding proteins in the polyhydroxyalkanoate biosynthetic pathway is 
disclosed. 

BACKGROUND OF THE INVENTION 

Metabolic engineering is a process by which the normal metabolism of an organism is 
altered to change the concentration of normal metabolites, or to create novel metabolites. This 
process often involves introduction or alteration of numerous enzymatic steps, and thus often 
requires introduction of multiple genes. An efficient system for introducing and expressing 
multiple genes is therefore desirable. In prokaryotes such as Escherichia coli, introduction of 
multiple genes is relatively straightforward in that operons can be constructed to express multiple 
open reading frames, or multiple complete genes can be expressed from a single piasmid. 
However, introduction of pathways into plants is more difficult due in part to the complexity of 
plant genes, the difficulty of constructing vectors harboring multiple genes for expression in 
plants, and the difficulty of introducing large vectors intact into plants, 

Polyhydroxyalkanoaies are bacterial polyesters that accumulate in a wide variety of 
bacteria. These polymers have properties ranging from stiff and brittle plastics to rubber-hike 
materials, and are biodegradable. Because of these properties, polyhydroxyalkanoaies are an 
attractive source of non-polluting plastics and elastomers. 

Currently, there are approximately a dozen biodegradable plastics in commercial use that 
possess properties suitable for producing a number of specialty and commodity products 
(Lindsay, Modem Plastics 2: 62, 1992). One such biodegradable plastic in the 
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poiyhydroxyalkanoate (PHA) family that is commercially important is Biopol™, a random 
copolymer of 3~hydroxybutyrate (3HB) and 3-hydroxyvalerate (3HV). This biopi&stic is used to 
produce biodegradable molded material (e.g., bottles), films, coatings, and in drug release 
applications. Biopol™ is produced via a fermentation process employing she bacterium 
Ralstonia eutropha (Byroni, D. Trends BiotechnoL 5: 246-250, 1987). ( R. eutropha was 
formerly designated Aicaligenes eutrophm [Yabuuehi et al, Microbiol. Immunol. 39:897-904, 
1995]). The current market price is $6-?/lb, and the annual production is 3,000 tons. By best 
estimates, this price can be reduced only about 2-fold via fermentation (Poirier, Y. et a!.. 
mo/Technology 13: 142, 1995). Competitive synthetic plastics such as polypropylene and 
polyethylene cost about 3S-45*/lb (Layman, Chem. <fi Eng. News, p. 10 (Oct. 31, 1994). The 
annual global demand for polyethylene alone is about 37 million metric tons (Poirier, Y. et ah. 
Int. J. Biol Mammal. 17; 7-12, 1995). It is therefore likely that the cost of producing P(3HB- 
C0-3HV) by microbial fermentation will restrict its use to low-volume specialty applications. 

Polyhydroxyaikanoate (PHA) is a family of polymers composed primarily of R-3- 
hydroxyalkanoie acids (Anderson, A. J. and Dawes, E. A. Microbiol. Rev. 54: 450-472, 1990; 
Steinbtiehel, A, in Novel Biomaierials from Biological Sources, ed. Byroni, D. (MaeMiiian, New 
York), pp. 123-213, 1991); Poirier, Y, Nawrath, C. & Somervilie, C. Biotechnology 13: 143- 
150, 3995). Polyhydroxybutyrate (PHB) is the most well-characterized PHA. High molecular 
weight PHB is found as intracellular inclusions in a wide variety of bacteria (Steinbnchel, A. in 
Novel Biommeriak from Biological Sources, ed. Byrom, D. (MaeMiiian, New York), pp. 123- 
213, 1991). in Ralstonia eutropto, PHB typically accumulates to 80% dry weight with 
inclusions being typically 0.2-1 urn in diameter. Small quantity of PHB oligomers of 
approximately 150 monomer units are also found associated with membranes of bacteria and 
eukaryotes, where they form channels permeable to calcium (Reusch, R. N„ Can J. Microbiol. 
41 (Suppl 1): 50-54, 1995). High molecular weight polyhydroxyalkanoates have the properties 
of tenoplasties and elastomers. Numerous bacteria and fungi can hydroiyze 
polyhydioxyalkanoates to monomers and oligomers, which are metabolized as a carbon source. 
Polyhydroxyaikanoates have accordingly attracted attention as a potential source of renewable 
and biodegradable plastics and elastomers. PHB is a highly crystalline polymer with rather poor 
physical properties, being relatively stiff and brittle (de Komng, (}., Can. J. Microbiol 41 
(Suppl. 1): 303-309, 1995), in contrast, PHA copolymers containing monomer units ranging 
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from 3 to 5 carbons for short-chain-length PHA (SCL-PHA), or 6 to 14 carbons for medium- 
chain-length PHA (MCI. .-PHA), are less crystalline and more flexible polymers (de Koning, Q., 
Can. J. Microbiol 41 (SuppL 1): 303-309, 1995). 

PHB has been produced in the plant Arabidopsis thaliana expressing the R, eutropha 
PHB biosynmetic enzymes {Poirier, Y. et al. Science 256: 520-523, 1992; Nawrath, C, et a!., 
Proc. Natl. Acad, Sci U.S.A. 91: 12760-12764. 1994). In plants expressing the PHB pathway in 
the piastids, leaves accumulated up to 14% PHB per gram dry weight (Nawrath, C, et ah, Proc. 
Natl. Acad Sci. U.S.A. 91: 12760-12764, 1994). High-level synthesis of PHB in plants opened 
the possibility of utilizing agricultural crops as a suitable system for the production of 
poiyhydroxyalkanoates on a large scale and at low cost (Poirier, Y. et ah. Biotechnology 13: 
143-150, 1995; Poirier, Y. et ah, FEMS Microbiol Rev. 103: 237-246, 1992; Nawrath, C, et a3, 
Molecular Breeding I: 105-22, 1995). PHB was also shown to be synthesized in insect cells 
expressing a mutant fatty acid synthase (Williams, M. D., et ah, Appl Environ. Microbiol. 62: 
2540-2546, 1996), and in yeast expressing the it. eutropha PHB synthase (Leaf, T. A., et al. 
Microbiol 142: 1169-1180, 1996). 

A number of pseudomonads, including Pseudomonas putida and Pseudomonas 
aeruginosa, accumulate MCL-PHAs when cells are grown on alkanoic acids (Anderson, A. J. & 
Dawes, E. A. Microbiol. Rev. 54: 450-472. 1990; Stetnbfichel, A. in Novel Biomaterlals from 
Biological Sources, eel Byrom, D. (MacMiihm, New York), pp. .123-213, 1991; Poirier, Y.. 
Nawrath, C. & Somerviiie, C. Biotechnology 13: 143-150, 1995). The nature of the PHA 
produced is related to the substrate used for growth and is typically composed of monomers 
which are 2n carbons shorter than the substrate. These studies indicate thai MCL-PHAs are 
synthesized by the PHA synthase from 3-hydroxyacyl-CoA intermediates generated by the p- 
oxidaticm of alkanoic acids (Huijberts, G. N. M, et al Appl Environ. Microbiol 58: 536-544, 
1992; Huijbens, (1 N. M., et aL, ,/. Bacterid 176: 1661-1666, 1994). 

Chen et al, (Nature Biotech., 16: 1060-3064, 199S; reviewed by Gelvin, S.B., Nature 
Biotech., 16: 10094010, 1998) describes the cobomhardrnent of embryogenic rice tissues with a 
mixture of 14 different pUC based piasmids. Integration of multiple transgenes was observed to 
occur at one or two genetic loci. 

Creating a transgenic host cell or plant that produces multiple enzymes within a 
biosynthetic pathway is often a daunting task. Individual vectors must be created for each 
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enzyme. Transformation of the host cell or plant is typically accomplished by one of three 
general methods: serial transformation, parallel transformation followed by crossing, or batch 
transformation. Each method has serious practical drawbacks. 

Serial transformation involves transforming a host cell or plant with the first vector, 
selecting and characterizing the transformed cell or plant, transforming with the second vector, 
and so on. This process can become quite laborious and time consuming. 

Parallel transformation followed by crossing involves separately transforming cells with 
each of the individual vectors, and subsequently mating or crossbreeding the transformed cells or 
plants to obtain a final ceil or plant which contains all of the individual sequences. This is a 
lengthy process, especially for the crossbreeding of plant lines. 

Batch transformation involves a single transformation event involving ail of the 
individual vectors. A wide array of cells are produced, each containing between none and ail of 
the vectors. While only a single transformation is required, extensive characterization of the 
resulting cells is necessary. As the number of vectors increases, it is increasingly likely that no 
cells will be obtained containing all of the vectors. If no desired transformed cells are identified, 
the transformation must be repeated. 

An additional concern with alj three of these methods is that they do not allow any 
control over the relative copy numbers of the individual vectors in the transformed ceil or plant. 
It would be desirable to have a transformation method that permits control of the relative copy 
numbers of the individual sequences in the transformed cell or plant, and also coordinates the 
positional effect of the insertion locus. 

There exists a need for improved materials and methods for the preparation of transgenic 
organisms transformed with multiple nucleic acid sequences encoding members of a multi- 
enzyme binsymhetic pathway. 

SUMMARY OF THE INVENTION 



"nvtMion involves tne construction and use of nucleic acid segments and 
containing multiple sequences encoding members of a biosynthetic pathway, The i 
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copies of each coding sequence within the constructed vector. Single or multiple copies of each 
coding sequence may easily be designed into the vector. 

An unexpected beneficial result of the invention is that organisms transformed with a 
multi-enzyme coding vector produce the biosynthetic product in higher yield than organisms 
produced by serial transformation, parallel transformation with crossing, or batch transformation 
methods. 

DETAILED DESCRIPTION OF THE INVENTION 

The invention is directed generally towards the construction and use of nucleic acid 
segments comprising sequences encoding multiple enzymes in a multi-enzyme biosynthetic 
pathway. The biosynthetic pathway may generally be any biosynthetic pathway. Examples of 
such multi-enzyme biosynthetic pathways are the TCA cycle, polyketide synthesis pathway, 
carotenoid synthesis, glycolysis, gluconeogenesis, starch synthesis, lignins and related 
compounds, production of small molecules that serve as pesticides, fungicides, or antibiotics, and 
polymer synthesis pathways. Preferably, the biosynthetic pathway is a poiyhydroxyaikanoate 
biosynthesis pathway. 

This disclosure describes muitigene vectors designed to produce poiyhydroxyaikanoate 
(PHA) in plants. Some of these vectors are designed to produce polytji-hydroxybutyrate), and 
some are designed to produce poly(p-hydroxybutyrate-co-p-hydroxyvalerate) (Gruys et a!., WO 
98/00557, 1998), In general, the efficiency of PHA production was dramatically increased when 
all sequences necessary for a pathway were introduced on the same vector. Herein, construction 
of these muitigene vectors, and their use for poiyhydroxyaikanoate production in Arabidopm 
ihaliana and Brassica napus, and lea mays is described. 

An embodiment of the present invention is an isolated nucleic acid segment comprising 
multiple nucleic acid sequences, each encoding a different protein within the biosynthetic 
pathway. Preferably, the isolated nucleic acid segment comprises a first nucleic acid sequence 
encoding a polyhydroxyalfcanoate synthase protein; a second nucleic acid sequence encoding a 
0-ketoacyl reductase protein; and a third nucleic acid sequence encoding a fTketothioiase 
protein. The nucleic acid segment may further comprise additional nucleic acid sequences 
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encoding additional proteins such as a threonine deaminase protein or a deregulated threonine 
deaminase protein. 

An alternative embodiment of the invention is a recombinant vector comprising multiple 
nucleic acid sequences, each encoding a different protein within the biosynthetic pathway. The 
recombinant vector may be arranged with a single promoter producing a polyeistronic RNA 
transcript from the multiple nucleic acid sequences, or with each nucleic acid sequence being 
under Use control of its own promoter. The multiple promoters may be me same or different, it 
is also possible to have one or more nucleic acid sequence under the control of its own promoter, 
while other nucleic acid sequences may be jointly under the control of a single promoter 
paiducing a polyeistronic RNA transcript, 

A recombinant vector placing the biosynthetic pathway nucleic acid sequences under the 
control of a single promoter preferably comprises operatively linked in the 5' to 3' direction: a 
promoter that directs transcription of the fust nucleic acid sequence, second nucleic acid 
sequence, and third nucleic acid sequence; a first nucleic acid sequence; a second nucleic acid 
sequence; a third nucleic acid sequence; a 3' transcription terminator; and a 3' polyadenylation 
signal sequence; wherein: the first nucleic acid sequence, second nucleic acid sequence, and third 
nucleic acid sequence encode different proteins; and the first nucleic acid sequence, second 
nucleic acid sequence, and third nucleic acid sequence are independently selected from the group 
consisting of a nucleic acid sequence encoding a polyhydroxyalkanoate synthase protein, a 
nucleic acid sequence encoding a jS-ketoacyi reductase protein, and a nucleic acid sequence 
encoding a fJ-ketothioiase protein. The nucleic acid sequences encoding the biosynthetic 
pathway enzymes may be in any order relative to each other and the promoter. The promoter 
must be expressed in plastids. It may have either been derived from a piastid, or may have been 
derived from a bacterium or phage having promoters recognized by the piastid transcription 
enzymes, or be a synthetic promoter recognized by the piastid transcription enzymes. 

A recombinant vector placing the biosynthetic pathway nucleic acid sequences under the 
control of multiple promoters preferably comprises a first element comprising operatively linked 
in the 5' to 3' direction: a first promoter that directs transcription of the first nucleic acid 
sequence; a first nucleic acid sequence encoding a polyhydroxyalkanoate synthase protein; a first 
T transcription terminator; a first 3' polyadenylation signal sequence; a second element 
comprising operatively linked in the 5' to 3' direction: a second promoter that directs 



WO {JO/52183 



PCT/USO0/OS93J 



transcription of the second nucleic acid sequence; a second nucleic acid sequence encoding a [V 
ketoacyl reductase protein; a second 3* transcription terminator; a second 3' poiyadeny.la.ti on 
signal sequence; and a third element comprising operatively linked in the 5' to 3* direction: a 
third promoter that directs transcription of the third nucleic acid sequence; a third nucleic acid 
sequence encoding a p-ketofhiolase protein; a third 3* transcription terminator; and a third 3' 
polyadenyiation signal sequence. The f>ketothioiase protein preferably condenses two 
molecules ofacetyl-CoA to produce acetoaeeryl-CoA: and condenses acetyi-CoA and propionyi- 
CoA to produce p-ketovaleryf-CoA. The p-ketoacyi reductase protein preferably reduces 
acetoacetyl-CoA to pMiydroxybutyryl-CoA; and reduces (J-ketovaieryl-CoA to p- 
hydroxyvaieryl-CoA. The polyhydroxyaikanoate synthase protein is preferably selected from 
the group consisting of: a polyhydroxyaikanoate synthase protein that incorporates (J- 
hydroxybutyryl-CoA into P(3HB) polymer; and a polyhydroxyaikanoate synthase protein thai 
incorporates a p-hydroxybtttyryit-CoA and a p-hydtx^xyvaieryl-CoA into P(3HB-co-3HV) 
copolymer. The ^-ketothiolase protein may comprise a transit peptide sequence that directs 
transport of the p-ketotiiiolase protein to the plastki. The [j-ketoacyl reductase protein may 
comprise a transit peptide sequence that directs transport of the p-ketoacyi reductase protein to 
the plasrid. The polyhydroxyaikanoate synthase protein may comprise a transit peptide sequence 
that directs transport of Ac polyhydroxyaikanoate synthase protein to the piastid. The 
recombinant vector may further comprise a nucleic acid sequence encoding a threonine 
deaminase protein or a deregulated threonine deaminase protein. The first promoter, second 
promoter, and third promoter are preferably active in plants. The first promoter, second 
promoter, and third promoter are preferably viral promoters. The first promoter, second 
promoter, and third promoter are preferably independently selected from the group consisting of 
a CMV 35S promoter, an enhanced CMV 35S promoter, maize chlorophyll A/B binding protein 
promoter, and an FMV 35$ promoter. More preferably, the first promoter, second promoter, and 
third promoter are the CMV 35S promoter. The first promoter, second promoter, and third 
promoter may be tissue specific promoters. The first promoter, second promoter, and third 
promoter may independently be tlx- Lewwrella hydroxylase promoter or the 7S congiycinin 
promoter, and preferably each is the Lesquerdla hydroxylase promoter. 
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An alternative embodiment is directed towards transformed host cells. Transformed host 
cells may contain a aon-integrated recombinant vector or an integrated recombinant vector. 

A transformed host cell may comprise a recombinant vector, wherein the recombinant 
vector comprises a first element comprising operatively linked in the 5' to 3' direction: a first 
promoter that directs transcription of the first nucieic acid sequence; a first nucleic acid sequence 
encoding a polyhydroxyalkanoate synthase protein; a first 3' transcription terminator; a first 3> 
poiyadenylation signal sequence; a second element comprising operatively linked in the 5' to 3' 
direction: a second promoter that directs transcription of the second nucleic acid sequence; a 
second nucleic acid sequence encoding a p-ketoacyl reductase protein; a second V transcription 
terminator; a second V poiyadenylation signal sequence; and a third element comprising 
operatively linked in the S' to 3' direction a third promoter that directs transcription of the third 
nucleic acid sequence; a third nucleic acid sequence encoding a p-ketothiolase protein; a third 3> 
transcription terminator; and a third 3' poiyadenylation signal sequence. 

The transformed host cell may alternatively contain an integrated nucleic acid segment. 
Preferably, the transformed host ceil may comprise a first element comprising operatively linked 
in the 5' to 3' direction: a first promoter to directs transcription of a fust nucieic acid sequence; 
a first nucleic acid sequence encoding a polyhydroxyalkanoate synthase protein; a first 3> 
transcription terminator; a first 3' poiyadenylation signal sequence; a second element comprising 
operatively linked in the 5' to 3' direction: a second promoter that directs transcription of a 
second nucleic acid sequence; a second nucleic acid sequence encoding a p-ketoacyi reductase 
protein; a second 3 s transcription terminator; a second 3' poiyadenylation signal sequence; and a 
third element comprising operatively linked in the 5' to 3* direction: a third promoter that directs 
transcription of a third nucleic acid sequence; a third nucleic acid sequence encoding a p- 
ketothiolase protein; a third 3' transcription terminator; and a third 3' poiyadenylation signal 
sequence. The fust element, second element, and third element may be eointegrated within & 
continuous 10 Mb segment of genomic DNA, more preferably within a continuous 5 Mb, 2.5 
Mb, 2 Mb, 1.5 Mb, 1 Mb, 500 kb,250 kb, 100 kb, 50 kb, or 20 kb segment of genomic DNA 
Alternatively, the first element, second element, and third element may be eointegrated between 
a left Ti border sequence and a right Ti border sequence. While it is preferable that a 
recombinant vector contain a single left ft border sequence and a single right Ti border 
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sequence, the invention encompasses recombinant vectors containing multiple left and/or right Ti 
border sequences, and the use thereof. 

Alternatively, the host cell may comprise a nucleic acid segment containing nucleic acid 
sequences encoding enzymes in a biosynthetie pathway, where a single promoter directs 
transcription of the nucleic add sequences. 

The transformed host cell may generally be any host cell, and preferably is a bacterial, 
fungal, or plant cell The bacterial ceil is preferably an Escherichia colt cell. The fungal cell is 
preferably a yeast, Saceharomyces cerevisiae, or Schimsaccharomyces pombe cell. The plant 
cell may be a monocot plant ceil, a dicot plant cell, an algae cell, or a conifer plant ceil. The plant 
cell is preferably a tobacco, wheat, potato, Arabuktpsh, corn, soybean, eaaoia, sugar beet, oil 
seed rape, sunflower, flax, peanut sugarcane, swttchgrass, or alfalfa cell. 

The promoters may be any of the promoters discussed earlier. The transformed host cells 
preferably produce polyhydroxyalkanoate polymer. 

The invention also encompasses transformed plants. The transformed plant may contain 
an integrated set of nucleic acid sequences, or may contain the same set of nucleic acid 
sequences on a non-integrated vector. A preferred embodiment is directed towards a 
transformed plant comprising a fust element comprising operatively linked in the 5' to T 
direction: a first promoter that directs transcription of a fust nucleic acid sequence; a first nucleic 
acid sequence encoding a polyhydroxyalkanoate synthase protein; a first V transcription 
terminator; a first 3' poiyadenylatkm signal sequence; a second element comprising operatively 
linked in the 5' to 3' direction: a second promoter that directs transcription of a second nucleic 
acid sequence; a second nucleic acid sequence encoding a p-keioacyl reductase protein; a second 
3' transcription terminator; a second 3 ! poiyadenyktion signal sequence; and a third element 
comprising operatively linked in the 5' to 3 ! direction: a third promoter that directs transcription 
of a third nucleic acid sequence; a third nucleic acid sequence encoding a p-keiothiolase protein; 
a third 3' transcription terminator; and a third 3' poiyadenyktion signal sequence. The first 
element, second element, and third element may be cointegrated within a continuous 10 Mb 
segment of genomic DMA, more preferably within a continuous 5 Mb, 2.5 Mb, 2 Mb. 1.5 Mb, 1 
Mb 500 kb, 250 kb, 1 00 kb, 50 kb, or 20 kb segment of genomic DNA. Alternatively, the first 
element, second element, and third element may be cointegrated between a left Ti border 
sequence and a right Ti border sequence. 
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Aftemativeiy, the transformed plant may comprise a nucleic add segment containing 
nucleic acid sequences encoding enzymes in a biosynthetic pathway, where a single promoter 
directs transcription of the nucleic acid sequences. 

The transformed plant may generally be any type of plant, and preferably is a tobacco, 
wheal, potato, Anibkhpsis, corn, soybean, canola, oil seed rape, sunflower, flax, peanut, 
sugarcane, switchgrass, or alfalfa plant. 

The promoters may be any of the promoters discussed earlier. The transformed plant 
preferably produces polyhydroxyalkanoate polymer. 

The invention also encompasses methods of preparing transformed host cells. The 
methods may produce a transformed host cell having nucleic acid sequences under the control of 
multiple promoters or under the con trol of a single promoter. The method preferably comprises 
the steps of selecting a host cell: transforming the selected host cell with a recombinant vector 
comprising; a first element comprising operatively linked in the 5' to 3' direction: a first, 
promoter that directs transcription of the first nucleic acid sequence; a first nucleic acid sequence 
encoding a polyhydroxyalkanoate synthase protein; a first 3' transcription terminator; a first 3' 
polyadenylatton signal sequence; a second element comprising operatively linked in the 5' to 3' 
direction: a second promoter that directs transcription of the second nucleic acid sequence; a 
second nucleic acid sequence encoding a (3-ketoacyi reductase protein; a second IV transcription 
terminator; a second 3' polyadenylatton signal sequence; and a third element comprising 
operatively linked in the 5' to 3' direction: a third promoter mat directs transcription of the third 
nucleic acid sequence; a third nucleic acid sequence encoding a fS-ketothioiase protein; a Third 3* 
transcription terminator; and a third 3' poiyadenylation signal sequence; and obtaining 
transformed host cells; wherein the transformed host cells produce polyhydroxyalkanoate 
polymer. 

Alternatively, the method of preparing transformed host cells may comprise the steps of 
selecting a host cell; transforming the selected host cell with a recombinant vector comprising 
operatively linked in the 5' to 3' direction; a promoter that directs transcription of a first nucleic 
acid sequence, second nucleic acid sequence, and third nucleic acid sequence; a first nucleic acid 
sequence; a second nucleic acid sequence; a third nucleic acid sequence; a 3' transcription 
terminator; and a 3' poiyadenylation signal sequence; and obtaining transformed host cells; 
wherein: the first nucleic acid sequence, second nucleic acid sequence, and third nucleic acid 
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sequence encode different proteins; the first nucleic acid sequence, second nucleic acid sequence, 
and third nucleic acid sequence are independently selected from the group consisting of a nucleic 
acid sequence encoding a polyhydroxyalkanoate synthase protein, a nucleic acid sequence 
encoding a {J-ketoacyj reductase protein, and a nucleic acid sequence encoding a ft-ketothioiase 
protein; and the transformed host cells produce polyhydroxyalkanoate polymer. 
The promoters may be any of the promoters discussed earlier. 

Also disclosed are methods for preparing transformed plants. The methods may produce 
a transformed plant having nucleic acid sequences under the control of multiple promoters or 
under the control of a single promoter. The method preferably comprises the steps of selecting a 
host plant cell; transforming the selected host plant cell with a recombinant vector comprising: a 
first element comprising operatively Jinked in the 5' to 3' direction: a first promoter that directs 
transcription of a first nucleic acid sequence; a first nucleic acid sequence encoding a 
polyhydroxyalkanoate synthase protein; a first 3' transcription terminator; and a first. 3' 
polyadenyiation signal sequence; a second element comprising operatively linked in the 5* to 3' 
direction: a second promoter that directs transcription of a second nucleic acid sequence; a 
second nucleic acid sequence encoding a p-ketoacyl reductase protein; a second 3' transcription 
terminator; and a second 3' polyadenyiation signal sequence; and a third element comprising 
operatively linked in the 5' to 3' direction: a third promoter that directs transcription of a third 
nucleic acid sequence; a third nucleic acid sequence encoding a p-ketothioSase protein; a third 3" 
transcription terminator; and a third 3' polyadenyiation signal sequence; obtaining transformed 
host plant cells; and regenerating the transformed host plant cells to produce transformed plants, 
wherein the transformed plants produce polyhydroxyalkanoate polymer. 

Alternatively, the method of preparing a transformed plant may comprise the steps of 
selecting a host plant cell; transforming the selected host plant cell with a recombinant vector 
comprising operatively linked in the 5' to 3* direction: a promoter that directs transcription of a 
first nucleic acid sequence, second nucleic acid sequence, and third nucleic acid sequence; a first 
nucleic acid sequence; a second nucleic acid sequence; a third nucleic acid sequence; a 3' 
transcription terminator; and a 3' polyadenyiation signal sequence: obtaining transformed host 
plant cells: and regenerating the transformed host plant cells to produce transformed plants; 
wherein; the first nucleic acid sequence, second nucleic acid sequence, and third nucleic acid 
sequence encode different proteins; the first nucleic acid sequence, second nucleic acid sequence. 
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and third nucleic acid sequence are independently selected from the group consisting of a nucleic 
acid sequence encoding a polyhydroxyalkaooate synthase protein, a nucleic acid sequence 
encoding a p-ketoacyi reductase protein, and a nucleic acid sequence encoding a p-ketothiolasc 
protein; and the transformed plants produce polyhydroxyalkanoate polymer. 
T he promoters may be any of the promoters discussed earlier. 

The invention is also directed towards methods of producing biomolecuies of interest. 
The multiple enzymes in the biosyntbetic pathway may lead to the production of materials of 
commercial and scientific interest. Preferably, the biomolecuies are polymers, and more 
preferably are polyhydroxyalkanoate polymers. The methods; may comprise obtaining any of the 
above described transformed host cells or transformed plants, cultnring or growing the 
transformed host ceils or transformed plants under conditions suitable for the production of 
polyhydroxyalkanoate polymer, and recovering polyhydroxyalkanoate polymer. The methods 
may further comprise the addition of nutrients, substrates, or other chemical additives to the 
growth media or soil to facilitate production of polyhydroxyalkanoate polymer. In a preferred 
embodiment, it is possible to extract the polyhydroxyalkanoate from the transformed host cells or 
transformed plants without killing the host ceils or plants. This may be accomplished, for 
example, by various solvent extraction methods or by engineering the host cells or plants to 
secrete the polyhydroxyalkanoate polymer, or by directing production to tissues such as leaves or 
seeds which may be removed without causing serious injury to the plant. The 
polyhydroxyalkanoate polymer produced is preferably poiy(3-hydroxybutyrate), poly(3- 
hydroxybutyrate-co-3-hydroxyvalerate), poiy(4-hydroxybutyraie), or poly(3-hydn>xybutyrate- 
co-4-hydroxybutyrate). 

If repetitive sequences are used in a multi-gene piasmid system, there exists the 
possibility tor gene silencing in subsequent generations of plants. If expression levels are high 
gene silencing could also occur and would be independent of repetitive elements. Repetitive 
sequences may include the use of the same promoters, chloropiast peptide encoding sequences, 
and other genetic elements for each of the multi-gene coding sequences. Gene silencing often 
manifests itself as a gradual reduction in protein levels, mRNA levels, or biosynthesis product 
concentrations in subsequent generations of related plants, if gene silencing is observed, 
changing the repetitive sequences through the use of diverse genetic elements such as different 
promoters, leaders, introns, transit peptide sequences, etc., different designed nucleotide 
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sequence, or through, mutagenesis of the existing sequence, may be successful in reducing or 
eliminating the gene silencing effects. 

DESCRIPTION OF THE FIGURES 

The following figures form part of the present specification and are included to further 
demonstrate certain aspects of the present invention. The invention may be better understood by 
reference to one or snore of the.se drawings in combination with the detailed description of 
specific embodiments presented herein. 

Figure 1 : Biosynthesis of poly(p-hydroxybutyrate-co-p-hydroxyvalerate) <poly(3HB-co- 

3HV), PHBV) in Raisfoma euiropha. 
Figure 2: Plant transformation strategies for multi-enzyme metabolic pathway engineering. 
Figure 3: Plasmid map of pMON25642. A list of the restriction enzyme cutting sites for 

pMON25642 is provided in Table 10. 
Figure 4: Plasmid map of pMONi0098. A list of the restriction enzyme cutting sites for 

pMONI 0098 is provided in Table 3 1 . 
Figure 5: Plasmid map of pMON969. A list of the restriction enzyme cutting sites for 

pMON969 is provided in Table 12. 
Figure 6: Plasmid map of pMON25661. A list of the restriction enzyme cutting sites for 

pMON2566i is provided in Table 13. 
Figure 7: Plasmid map of pMON25897. A list of the restriction enzyme cutting sites for 

pMON25897 is provided in Table 14, 
Figure 8: Plasmid map of pMON25662. A list of the i !!!•..'! i -a n cui i>z siu t 

pMON25662 is provided in Table 1 5. 
Figure 9: Plasmid map of pMON25663. A list of the restriction enzyme cutting sites for 

pMON25663 is provided in Table 16. 
Figure 1 0: Plasmid map of pMON25943. A list of the restriction enzyme cutting sites for 

pMON25943 is provided in Table 1 7. 
Figure 1 1 : Plasmid map of pMON25948. A list of the restriction enzyme cutting sues for 

pMON25948 is provided in Table 1 8. 
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Figure 1 2: Piasmid map of pMON25949. A list of the restriction enzyme cutting sites for 

pMON25949 is provided in Tabic 19. 
Figure 13: Piasmid map of pMON25951. A list of the restriction enzyme cutting sites for 

pMON2595 1 is provided in Table 20. 
Figure 14: Piasmid map of pMON34545. A list of the restriction enzyme cutting sites for 

pMON34545 is provided in Table 21 . 
Figure 15: Piasmid map of pMON34565. A list of the restriction enzyme cutting sites for 

pMON34565 is provided in Table 22. 
Figure 16: Piasmid map of pMON25995. A list of the restriction enzyme cutting sites for 

pMON25995 is provided in Table 23. 
Figure 1 7: Piasmid map of pMGN2$973. A list of the restriction enzyme cutting sites for 

pMON25973 is provided in Table 24, 
Figure 1 8: Piasmid map of pMON259S7. A list of the restriction enzyme cutting sites for 

pMON2598? is provided in Tabic 25. 
Figure 19: Piasmid map of pMON25991 . A list of foe restriction enzyme cutting sites for 

PMON2599J is provided in Table 26. 
Figure 20: Piasmid map of pMON25992. A list of the restriction enzyme cutting sites for 

pMON25992 is provided in Table 27. 
Figure 21 : Piasmid map of pMON25993 . A list of the restriction enzyme cutting sites for 

pMGN25993 is provided in Table 28. 
Figure 22: Piasmid map of pMON36805. A list of the restriction enzyme cutting sites for 

PMON36805 is provided in Tabic 29. 
Figure 23; Piasmid map of pMON36814. A list of the restriction enzyme cutting sites for 

pMON36814 is provided in Table 30. 
Figure 24: Piasmid map of pMON368I6. A list of the restriction enzyme cutting sites for 

pMON3681 6 is provided in Table 3 1 . 
Figure 25; Piasmid map of pMON36824. A list of the restriction enzyme cutting sites for 

PMON36824 is provided in Table 32. 
Figure 26: Piasmid map of pMON36843. A list of foe restriction enzyme cutting sites for 
PMON36843 is provided in Table 33. 
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Figure 27: Pbsmid map of pMON34543. A list of the restriction enzyme cutting sites for 

pMON34543 is provided in Table 34, 
Figure 28: Plasmid map of pMON36850. A list of the restriction enzyme cutting sites for 

pMON36850 is provided in Table 35. 
Figure 29: Plasmid map of pMON25963. A list of the restriction enzyme cutting sites for 

pMON2S963 is provided in Table 36. 
Figure 30: Plasmid map of pMON2S965. A list of the restriction enzyme cutting sites for 

PMON2596S is provided in Table- 37. 
Figure 3 1 : Method for creating multi-gene vectors. 

Figure 32: PHB biosynthetic pathway. PHB production requires the condensation of two 
acctyi-CoA molecules using a p-ketothiolase, a D~isomer~specifte reduction by 
acetoaeetyl-CoA reductase, and PHB polymerization by PHB synthase. The 
genes encoding these enzymes are indicated in parentheses. 

Figure 33 : Schematic diagram of multi-gene vector used to transform Enmica napus. 

Vectors were constructed using modular cassettes. Each cassette consists of the 
lesquerella hydroxylase promoter (P-Lh), a chloropiast transit peptide (ctp) fused 
to an open reading frame encoding a PHB synthesis enzyme, and the E9 V 
terminator. The plasmid also expresses EPSP synthase to provide resistance to 
glyphosate, contains bacterial replication origins, and a bacterial ly-expressed gene 
encoding resistance to streptomycin and spectinomycin. in pMON36814, hklB 
was replaced w&phbA. Otherwise, the vectors were identical. RB. right border 
of T-DNA; LB, left border of T-DNA. 

Figure 34: Electron micrographs of BrassUxt napus plastids. Panel A: Leukoplast from wild 
type Brassica napus seed. Panel B: Leukoplast from Brassica napus seed 
producing PHB. Polymer (PHB) and oii bodies (O) are indicated. Note the 
greatly expanded size of leukoplasts in the PHB-produeing line. 

Figure 35: A pathway designed to produce poiy(^ydroxybutyrute-eo-^hydroxyvalerate) in 
the plastids of plants. Propionyl-CoA is derived from threonine via threonine 
deaminase and the pyruvate dehydrogenase complex. Acetyl-CoA is drawn from 
normal intermediary metabolism. The pathway requires transformation of the 
plant with four genes tiivA, bktB. phbB. and phbC). and relies on endogenous 
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pyruvate dehydrogenase. Ail enzymes encoded by transgenes are targeted to the 
plastid using chioroplast transit peptides. 

Figure 36: Concentrations of selected 2-fceto acids and amino acids in control plants and in 
Arabidopsis expressing threonine deaminase. (A) Comparison of pyruvate and 2- 
ketobutyrate concentrations in Arabidopsis harboring either a control plasmid or a 
plasmid expressing wild type £ colt iJvA (threonine deaminase). (8) 
Comparison of threonine, isoieucine, and 2-ketobutyrate concentrations in 
Arabidopsis harboring either a control plasmid or a plasmid expressing wild type 
E. coli ilvA. Note the different scales used in parts (A) and (B). 

Figure 37; 1 3 C NMR spectra demonstrating poly(p-hydroxy btityrate-co-p-hydroxy valerate) 
copolymer production in transgenic Arabidopsis, Note the presence of signals 
indicating presence of both 3 -hydroxy butyrate and 3-hydroxy valerate side chains, 

Figure 3 8 ; Analyses of total polymer production, the 3-hydroxyvaterate fraction of the 

polymer, and the activity of threonine deaminase Brmsiea oilseeds synthesizing 
PHBV copolymer. Note the distinct negative correlation between polymer 
concentration and the 3-HV content of the polymer. Abo note that increasing 
threonine deaminase activity does not lead to increased 3-HV content. 

Figure 39: Multiple potential routes to produce propionyl-CoA in planla. Most alternative 
pathways have the potential to produce propionyl-CoA in plants. However, 
production of propionyl-CoA from threonine provides the most direct route. 

Figure 40: Bar graph of average % PHA produced from Arabidopsis transformation 
methods. 

Figure 41: Bar graph of average % PHA produced from eanoia transformation methods. 
Figure 42: Bar graph of maximum % PHA produced from Arabidopsis transformation 
methods. 

Figure 43 ; Bar graph of maximum % PHA produced from canola transformation methods, 
DEFINITIONS 

The following definitions are provided in order to aid those skilled in the an. in 
understanding the detailed description of the present invention. 
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"Acyl-ACP thioesterase" refers to protein? which catalyze the hydrolysis of aeyl-ACF 
thioesters. 

-terminal region'' refers to the region of a peptide, polypeptide, or protein chain from 
the middle thereof to the end that carries the amino acid having a free a carboxyl group (the C- 
5 terminus). 

"CoA" refers to coenzyme A. 

The phrases "coding sequence", "open reading frame", and "structural sequence" refer to 
the region of continuous sequential nucleic acid triplets encoding a protein, polypeptide, or 
peptide sequence. 

a) The term "encoding DNA" or "encoding nucleic acid" refers to chromosomal nucleic 

acid, piasmid nucleic acid, cDNA, or synthetic nucleic acid which codes on expression for any of 
the protejns or fusion proteins discussed herein. 

"Fatty acyl hydroxylase" refers to proteins which catalyze the conversion of fatty acids to 
hydroxylated fatty acids. 

ss fhe ten geno it > o bacteria encornf k) e( m< 'ie and 

plasmids within a bacterial host cell Encoding nucleic acids of the present invention introduced 
into bacteriai host cells can therefore be either chromosomaily-integrated or piasmid-iocaliz.ed. 
The term "genome" as it applies to plant cells encompasses not only chromosomal DNA found 
within the nucleus, but organelle DNA found within subcellular components of the cell. Nucleic 

20 acids of the present invention introduced into plant cells can therefore be either chromosornally- 
integ; tted or oi 1 n lie localized 

"Identity" refers to the degree of similarity between two nucleic acid or protein 
sequences. An alignment of the two sequences is performed by a suitable computer program. A 
widely used and accepted computer program for performing sequence alignments is 

25 CLUSTALW vl.6 (Thompson, et al. Nud. Acids Res., 22: 4673-4680, 1994}. The number of 
matching bases or amino acids is divided by the total number of bases or amino acids, and 
multiplied by 100 to obtain a percent identity. For example, if two 580 base pair sequences had 
145 matched bases, they would be 25 percent identical If the two < | lences are oi 

different lengths, the number of matches is divided by the shorter of the two lengths. For 

so example, if there were 100 matched amino acids between 200 and a 400 amino acid proteins, 
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they are 50 percent identical with respect to the shorter sequence. If the shorter sequence is less 
than 150 bases or 50 amino acids in length, the number of matches are divided by 150 (for 
nucleic acids) or 50 (for proteins); and multiplied fay 100 to obtain a percent identity. 

The terms "microbe" or "microorganism" refer to algae, bacteria, fungi, and protozoa. 
"N-terminai region" refers to die region of a peptide, polypeptide, or protein chain from 
the amino acid having a free a amino group to the middle of the chain. 

"Nucleic acid" refers to ribonucleic acid (RNA) and deoxyribonucleic acid (DNA). 
A "nucleic acid segment" is a nucleic acid molecule that lias been isolated free of total 
genomic DNA of a particular species, or that has beer, synthesized, Included with the term 
"nucleic acid segment" are DNA segments, recombinant vectors, plasmids, crowds, phagemids, 
phage, viruses, etcetera. 

"Overexpression" refers to die expression of a polypeptide or protein encoded by a DNA 
introduced into a host cell, wherein said polypeptide or protein is either not normally present in 
the host cell or wherein said polypeptide or protein is present in said host ceil at a higher level 
than that normally expressed from the endogenous gene encoding said polypeptide or protein. 

The terra "plastid" refers to the class of plant ceil organelles that includes amyfoplasts, 
chloroplasts, chromopksts, elaiopiasts, eoplasts, etiopiasls, leucoplasts, and proplastids. These 
organelles are self-replicating, and contain what is commonly referred to as the "chloropiast 
genome," a circular DNA molecule that ranges in size from about 3 20 to about 217 fcb, 
depending upon the plant species, and which usually contains an inverted repeat region (Fosket, 
Plant growth and Development, Academic Press, inc., San Diego, CA, p. 132, 1994). 

"Polyadenylation signal" or "polyA signal"' refers to a nucleic acid sequence located 3" to 
a coding region that directs the addition of adenylate nucleotides to the 3' end of the mRNA 
transcribed from the coding region. 

The term '"poiyhydroxyaikanoate (orPHA) synthase" refers to enzymes that convert 
hydroxyacyi-CoAs to poiyhydroxyalkanoates and free CoA. 

The term "promoter" or "promoter region" refers to a nucleic acid sequence, usually 
found upstream (5') to a coding sequence, that controls expression of the coding sequence by 
controlling production of messenger RNA (mRNA) by providing the recognition site for RNA 
polymerase and/or other factors necessary for start of transcription at the correct site. As 
contemplated herein, a promoter or promoter region includes variations of promoters derived by 
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means of ligation to various regulatory sequences, random or controlled mutagenesis, and 
addition or duplication of enhancer sequences. The promoter region disclosed herein, and 
biologically functional equivalents thereof, are responsible for driving the transcription of coding 
sequences under their control when introduced into a host as part of a suitable recombinant 
vector, as demonstrated by its ability to produce roRNA. 

"Regeneration 5 ' refers to the process of growing a plant from a plant cell (e.g., plant 
protoplast or explant). 

"Transformation" refers to a process of introducing an exogenous nucleic acid sequence 
(e.g., a vector, recombinant nucleic acid molecule) into a cell or protoplast in which that 
exogenous nucleic acid is incorporated into a chromosome or h capable of autonomous 
replication, 

A "transformed cell" is a ceil whose nucleic acid has been altered by the introduction of 
an exogeno us nucleic acid molecule into that cell, 

A "transformed plant" or "transgenic plant" is a plant whose nucleic acid has been altered 
by the introduction of an exogenous nucleic acid molecule into that plant, or by the introduction 
of an exogenous nucleic acid molecule into a plant cell from which the plant was regenerated or 
derived. 

The following examples are included to demonstrate preferred embodiments of the 
invention, It should be appreciated by those of skill in the an that the techniques disclosed in the 
examples which follow represent techniques discovered by the inventors to function well in the 
practice of the invention, and thus can be considered to constitute preferred modes for its 
practice. However, those of skill in the art should, in light of the present disclosure, appreciate 
that many changes can be made in the specific embodiments which are disclosed and still obtain 
a like or similar result without departing from the spirit and scope of the invention. 

EXAMPLES 

f - iM.ELj j < its ■ >1 nuclei c acid seq uences 

Nucleic acid sequences encoding the polyhydroxyalfcanoate biosynthetic pathway 
include; phbA and phbB (GcnBank accession number J04987), phbC (GenBank accession 
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nuinber J05003), and hktB (GenBanfc accession number AF026544). Production of PHBV 
copolymer can be accomplished by also expressing E. colt itvA (GenBank accession number 
1100096, overlapping base 3953951 :Gruys et al. WO 98/00557). The Ti DNA left border 
sequence Is described in Baker, R.F., et al (Plant Mot. Biol, 2: 335-350, 1983). The Ti DNA 
right border sequence is described in Depicker et al. (,/. Mol. Ap>>. Genet. 1 : 561, 1982). 

[ 7 s N " 1 ' b 1 ' ath Arabicbpsis p lants 

Polyhydroxyalkanoates are a form of polyester accumulated by numerous bacterial 
species as a carbon and energy repository. This class of polymer also has useful thermoplastic 
properties, and is therefore of interest as a biodegradable plastic. Pely$-hydroxybutyrate-co-f3- 
hydroxyvalerate) (poly(3HB-co-3HV), PHBV), a form of PHA, is commercially produced via 
fermentation of Rahionia eutrapha (Figure 1). However, it is expected that the cost of 
production could be dramatically decreased if PHA could be produced in transgenic plants. The 
first attempts at PHA production in plants utilized transgenic Arabidopsls expressing the three 
genes required for the homopolymer poly-{3-hydroxybutyrate (PHB) (Nawrath, C. et al, Proc. 
Mail Acad Set USA. 91: 12760-12764, 1994). In this work, the authors transformed 
Arabkfopsis plants with three independent gene cassettes and crossed the plants using traditional 
breeding methods. They reported PHB production up to 14% of the eel! dry weight However, 
this method took a significant amount of time before the three gene pathway could be assembled. 
In addition, the plants did not maintain a stable phb* phenotype, as determined by our analysis of 
the progeny of these original plants {Table 1). This problem may be due to co-suppression 
(Finnegan, I, and D. McElroy. Bio/Technology. 12: 883-888, 1994), or to segregation of high- 
producing insertions in the progeny. The plants produced by Mawrath et a!, were not fully 
characterized genetically, although it is known that all contained multiple insertions of the 
transgenes. 
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Table 1 . Enzyme activity and polymer data of progeny of Nawrath Arabidopsis lines. 
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LXAMPLE 3; Csg_oj .multiple vecto rs to introduc e I'll \ mosyntheass sequences into 

One vector was constructed containing sequences encoding both acetoaeetyl-CoA 
reductase and PHB synthase proteins. A second vector was constructed containing a sequence 
encoding a Ncetothtolase protein. Two independent transformation events were obtained 
corresponding to each of these vectors. The complete pathway was assembled into a single plant 
using traditional cross-breeding methods. In ail cases, plants exhibiting Mendelian segregation 
consistent with transgene insertion at a single locus were chosen. The results of these 
experiments are shown in Table 2. 

The second strategy pursued was to simultaneously co-transform both plasmids into a 
single plant (simultaneous co-transformation) and assay the primary transformant for polymer 
accumulation, or to re-transform plants that already harbored a single vector (serial co- 
transformation). The results of these experiments are summarized in Table 3. Although the 
activity of enzymes expressed from the encoding sequences was comparable to that reported by 
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Nawrath et ah, none of (he plants generated reached the polymer levels reportedly achieved in 
their study. Neither their experiments nor these results correlate enzyme activity with the 
intracellular concentration ofPHA polymer (Nawrath, C. et ai., Proc. Nail Acad, Set USA, 91: 
12760-12764, 1994). 
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Table 2, Polymer data for Arabhiopsix crosses. 
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64/104 plants positive; AVE - average; SD - standard deviation. 
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Table 3. Polymer data for re-transformed and co-transformed Arabidopsis. 
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64/145 plants positive. 

RE indicates that: this vector was used to re-transform a plant line. 
AVE - average. 
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In an attempt to increase the speed and simplicity of genetic analysis, multigene vectors 
were constructed containing the entire PHB biosynthetic pathway on a single pksmid. 
MuHigene vectors for PHA production in Arabidopsis were constructed from a series of base 
vectors, each with the desired open reading frame under control of the e35s promoter (Odeli, 
J.T., etaL, Ntaure, 313: 810-812, 1985) and the E9 T region (Conizzi, EMBOJ. 3:1671-1679, 
1984), The first vector in this series, pMON25642 (Figure 3) , harbors phbC under control of the 
e35s promoter in pMON10098 (Figure 4), a vector designed for Agrobactermm^dmtcd 
transformation of plants. The remaining intermediate vectors are all derived from pMON969 
(Figure 5\ & high copy-number vector harboring the e35s promoter and the E9 V region. 
Constructs derived from pMON969 include those encoding phbA (pMON25661 ; Figure 6), hkiB 
(PMON25897; Figure 7), phhB (pMON25662; Figure 8), and UvA (pMON25663; Figure 9). 
From these and similar vectors were derived the final plasroids for transformation of 
Arabichpsis; pMON25943 (Figure 10) pMON25948 (Figure 11), pMON25949 (Figure 12\ 
pMON25951 (Figure 13), and pMON34545 (Figure 14). All cloning procedures were performed 
using standard ligation techniques (Samhrook, J.„ et a!\ "Molecular cloning: A laboratory 
manual," Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989), except 
that ligation of AtorAcut pMON25949 with the /A^-contatning Not! restriction fragment of 
pMON25663 produced plasmid pMON34565 (Figure 15), that serendipitously contained two 
copies of the UvA fragment. Each copy of UvA contains a SnaBI restriction site, so deletion of a 
3155 bp SnaBI restriction fragment from pMON34565 produced plasmid pMON34545, a 
plasmid wi th a single copy of UvA. 
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The final vectors, pMON25943, pMON25948, pMON25949, pMON25951, and 
pMON34545 were used for AgrobacteHum-mediatsd transformation of Arahidopsis (Bechtold 
N., et al. Complex Rendvs Acad. Set. Paris Sciences Serte HI Sciences de la Vie. 316; 1194- 
1 199, 1993). This approach has proven successful in generating lines with the highest levels of 
PHB obtained to date in our laboratory. PHA production in the plants resulting from the first 
four of these vectors is summarized in Table 4. Data from pMON34545 transformations will he 
obtained. All of the data in Table 4 were derived from heterozygous plants, and the polymer 
concentration may increase once the plants are brought to homozygosity. For example, one plant 
that produced about 7% PI-IB by dry weight when heterozygous produced polymer up to 13% 
when homozygous. 



Table 4. Polymer results from Arabhfopsis derived from multigene vectors. 
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1 53/172 plants positive for PHB; 7 had greater than 4% dry weight; AVE - average; SD « 
standard deviation 



These results demonstrate that use of a multigene vector provides consistently higher 
s of polymer production than were achieved using multiple vectors. The striking beneficial 
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resuks in polymer production obtained from the use ofraukigene vectors are visually displayed 
in Figures 40 and 42. 

There are several possible explanations for the increased levels of polymer present in the 
mukigeoe vector transformants. One explanation derives from the fact that it was possible to 
generate more independent lines with the multigene vectors, and the screening of more plants 
allowed detection of the relatively rare high-producing lines. This is one clear advantage of 
having the entire pathway on a single vector, but the distribution of polymer production in plants 
produced by the various methods suggests that numbers alone do not account for the increased 
polymer production of multigene vectors. It is also possible that having a. metabolic pathway 
genetically linked at a single integration locus is more metabolically favorable due to some level 
of concerted gene expression and/or mRNA metabolism. This phenomenon is common in 
bacteria, but there are not many examples of clustering genes in plants for concerted gene 
expression. Another possibility is that the high local concentration of promoters may lead to 
locally high levels of transcription factors. Still another possibility is that having the genes 
tightly linked may reduce gene silencing, or co-suppression, in certain cases. 

KXAMPLr- , \ Extra cti on of p oly mer irom ArabidopshsaA analysis of polym er 

For isolation of polymer from Arabkhpsis, stems and leaves were harvested and 
dehydrated by lyophilization for approximately 36 hours. The material was ground to a fine 
powder, and 100 rng of powder was treated with 10 ml CLOROX bleach (CXGROX is a 
registered trademark of The Clorox Company, Oakland, CA) for I hour with shaking at room 
temperature. The extract was subjected to centrifugation at 2700 x g for 10 minutes at 4°C, and 
the supernatant solutions was carefully removed. Ten ml. 100% methanol were added, the 
solution was mixed by vortexing, and then centrifuged again. After a second, identical methanol 
extraction, the material was allowed to dry overnight. Polymer was extracted from the dried 
material with 1 ml of chloroform containing 1 pmol/mL methyl-benzoate standard. The tube 
was heated to 100°C for 2.5 hours, solid material was removed by centrifugation, and the 
supernatant material was subjected to methanolysis. Metaanalysis of polymer and gas 
chromatographic characterisation of the methyl-ester residues were performed as described by 
Slater et ah (J. Bacterial 180:1979-1987, 1998). 
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hXAMPI r p T.^ ofjmjhipje s kn ^ i the seeds < >i canola 

Production of poiyhydroxyalfcasioate has also been accomplished within the seed of 
canola (oil seed rape). Initial efforts followed essentially the same strategy as the initial 
Arabidopsis strategy. Thai is, one vector carried the sequences encoding acetoaeetyi-CoA 
reductase and PHA synthase proteins, while another carried the sequence encoding a p- 
ketothiokse protein. However the 7s promoter., which is expressed primarily in the seed, 
replaced the 35s promoter that was used in the Arabidopsis constructs. These 7s promoter 
vectors were used to transform oilseed rape, homozygous lines were crossed, and PHB 
accumulation was assayed in the resulting lines ( Table 5). A number of lines that produce PHB 
were identified, but all produced relatively low concentrations of polymer, with the best lines 
containing about 2% polymer by dry weight. 
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Table 5. Polymer results for canola crosses. 
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Large vectors for expression of multiple genes have also been used to produce 
poiyhydroxyalkanoate in the seeds of canola (oil seed rape), in this ease, the promoter was 
derived from the forty acid hydroxylase gene of Lesguerelia (P4h) (Broun, P. and C, Somerville. 
Plant Physiol. 313; 933-942, 1997), which is expressed primarily within the developing seed. A 
series of vectors, each expressing the entire PHA biosynthesis pathway, was used for 
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transformation of oilseed rape. The muhigene vectors were constructed from a series of base 
vectors, each with the desired open reading frame under control of the Lesqmrella hydroxylase 
promoter (P-ih; Broun, P. and Somervilie } C.R. Plant Physiol, 1 13; 933-942, 1987) and the E9 
y region. The first vector in this series, pMON2599S (Figure 16), harbors phhC under control of 
P-lh in pMON25973 (Figure 17), a vector designed for Agrobaaenum-mtdmlesk transformation 
of plants. The remaining intermediate vectors are all derived from pMON25987 (Figure 1 8), a 
high copy-number vector harboring P-lh and the E9 3' region. Constructs derived from 
pMON259«7 (Figure 16) include those encoding phhA (pMON25991; Figure 19), hklB 
(PMON25992; Figure 20). phbB (pMON25993; Figure 21), and UvA (pMON36805; Figure 22). 
These intermediate vectors were used to construct the final vectors for oilseed rape 
transformation; pMON368I4 (Figure 23), pMON36816 (Figure 24), and pMON36824 (Figure 
25). 

Construction x>f the muhigene vectors for oilseed rape was not as straightforward as was 
the construction of the Arahiebpsls vectors. This was primarily due to the large size of the 
promoter (P-lh is about 2.2 kb), and the resulting larger size of the muhigene vector 
intermediates. As the vectors increased in size, it was found to be most efficient to perform 
ligations of two similar sized fragments, rather than one large vector and one small incoming 
fragment. In addition, it was desirable to avoid partial digests of the large vectors, and to 
perform cloning in which opposite ends of an individual fragment were not compatible. A 
number of intermediate vectors were constructed specifically to allow cloning hi this manner. 
Another advantage of this approach is that it often allowed restriction enzyme-mediated 
digestion of the parental plasmids prior to transformation of Escherichia coli with ligation 
products. 1 his procedure significantly increased the frequency of correct constructs recovered. 
The final vectors were used for Agrobactemm-riKdmed transformation of oilseed rape (Fry , J. 
et ah, Plani Cell Rep. 6: 321-325, 1987), 

The results of oilseed .rape transformation with the muhigene vectors are shown in Table 
6. There are two primary points of interest in these data. First, muhigene vectors larger than 26 
kb were successfully constructed and used to transform oilseed rape, with a very low percentage 
of the plants failing to produce polymer. Second, the distribution of polymer concentrations 
among muhigene vector transformants is higher than that of the plants derived from two separate 
7s vectors. 
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Table 6. Polymer results from caoola transformed with multigene vectors. 
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The comparative results for PHA production in canola are graphically presented in 
Figures 41 and 43. The beneficial results obtained from the use of multigene vectors compared 
to results obtained from traditional methods is visually impressive. 

Since the promoters used in these two vectors sets (those containing the 7s promoter and 
those containing the Usquerella hydroxylase promoter) are different, it cannot be distinguished 
whether it was the lesquereila promoter or fee use of a single vector that led to the increased 
polymer concentration. However, it is clear that the single vector approach is viable for seed 
expression of enzymes, including those required for PHA biosynthesis. In addition, the 
increased speed of plant construction and analysis using a single vector is a clear benefit. 

t^MHI 1 ' fytr acrnn < + rj_SiL- 1 1 v - ^ > 

For isolation of polymer from canola seed, seeds were ground to a fine powder with a 
mortar and pestle. Approximately 200 mg of each sample were extracted two times with 10 ml. 
each of hexane for 1 hour at 60°C, then two times with 10 ml each of 100% methanol for one 
hour at 60°C. This procedure removed oil from the seed. The material was allowed to dry 
completely overnight. Polymer was extracted from the dried material with i mi, of chloroform 
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containing 1 pmoi/ml. methyl benzoate standard. The tube was heated to ?00 a C for 5 hours, 
solid material was removed by centrifugation, and the supernatant: material was subjected to 
methanolysis. Methanolysis of polymer and gas chromatographi c characterization of the methyl- 
ester residues were performed as described by Slater et al. (J. Bacterial. 180: 1979-1987, 1998). 

— - sfl ' ~ 11 ' euu , loi ge ne ex pression in monocots 

For reasons described above, muhtgene vectors will also be desirable for expression of 
multi-enzyme metabolic pathways in monocots. Therefore, vectors designed to produce PHA in 
the leaves of maize were constructed. These vectors use the e35s, eFMV , or maize chlorophyll 
A/B binding protein (P-ChiA/B) promoters, and include the HSP70 intron designed to enhance 
expression in monocots. All enzymes were fused to the Arabidopxis RuSisCo small subunit 
transit peptide. Other promoters might also be used. Examples of vectors designed for gene 
expression in monocots are pMON36843 (Figure 26), pMON34543 (Figure 27), and 
pMON36850 (Figure 28). These vectors have been used to transform maize, and polymer was 
analyzed as described above for Arabidopsis. Polymer production is summarized in Table 7. 



Table 7, Polymer produe 
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Since muitigene vectors are optima! for producing high levels ofPHB, and this strategy is 
potentially optimal for expression of other multiple step pathways, a simple method to produce 
very large, muitigene vectors is preferred. Figures 29 and 30 show pJasmids pMON25963 and 
pMON25965, respectively. These vectors, used together, provide a system for constructing very 
large vectors. Piasmid pMON25965 provides a shuttle vector by which a gene cassette can be 
cloned into the Not! restriction sites and thereby be flanked by a series of restriction sites. These 
restriction sites are relatively rare in many genomes, and thereby of utility for subcioning many 
genes. Piasmid pMON25963 is a binary vector designed for transformation of plants by 
Agrobacterium. It contains a polylinker with the same sites found flanking the Not! restriction 
sites of piasmid pMON25965. Using this system, a series of gene "cassettes'* can be produced 
using piasmid pMON2596S, and each can be sequentially ligated into piasmid pMON25963. 

in practice, a series of vectors similar to pMON25965, but having smaller poiylinkers, 
will be preferred. Specifically, this series of vectors would have a single Not! (or similar 
enzyme) restriction site flanked by one or several other restriction enzyme sites. By iigating 
cassettes flanked by large portions of the pMON25965 polylinker into p.MON25963, relatively 
large inverted repeats of polylinker DNA are formed. These inverted repeats are unstable in 
Escherichia coli, and piasraids harboring them do not replicate efficiently. Thus, diminishing the 
size of the polylinker in the shuttle vector can increase the probability of recovering stable 
recombinants. 

Another strategy for generating muitigene vectors and reducing the levels of background 
caused by vector re-ligation is shown in f igure 31. This strategy could be adapted to 
accommodate any number of enzymes, depending on the availability of unique restriction sites. 
One can easily design such a polylinker to accommodate one's cloning needs. As the vector 
becomes larger, one will want to have a larger homologous overlap for the ligation process or 
choose restriction eadonucJeases producing ends that are very easily ligated, and not self- 
compatible. By following the cloning procedure outlined in Figure 31, one can also control the 
directionality of the clone. If directionality is not important than clones generated from the 
ligation into the "shuttle vector" in either orientation could be used. (A<~ C or A -> C). 

As with any muitigene vector strategy, the starting piasmid used for constructing the 
large muitigene plasmids should be taken into consideration. The common plant transformation 
piasmid pBfN19 (Frisch, D. et ah, Plant Mol Biol 27: 405-409, 1995) has a starting size of 
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1 1 ,777 bp. In contrast plasmid pMONJ 0098 (Figure 4) has a starting size of 843 1 bp. The major 
difference between the two plasmids is the loss of the trfA function which is encoded in trans in 
Agrobactermm strain ABL Providing the trfA function in tram allows replication only in the 
specific strains of Agrobactermm engineered to harbor irjA. ft has been shown by Figurski and 
Helinski {Proc. Natl. Acad. Sci U.S.A. 76: 1648-1652, 1979) that replication factors can function 
m trans. By providing the minimal origins of replication required for maintenance in both 
Escherichia coli and Agrobactermm the starting size of the initial plasmid can be reduced 
significantly. 

Other possibilities to reduce the size of the starting plasmid would be to delete mil since 
this sequence is required for conjugations! transfer only. If electroporatkm is used to introduce 
the plasmid into Agrobactermm, oriT is not an essential dement. Another possibility would be 
to use selection that is functional in plains, Agrobacierium, and Escherichia coli. This could be 
accomplished by embedding into the plant promoter for the selectable marker a suitable bacterial 
promoter sequence and a ribosome binding site in proper context with the start codon on the 
selectable marker. One could also place this selectable marker on the plasmid flanked by its own 
right and left border sequences. This may allow for the selectable marker to be integrated into 
the plant chromosome unlinked to the genes of interest and potentially removed from subsequent 
generations. Alternatively, plants could be co-transformed by taking the mukigene plasmid and 
cotransforming on a separate plasmid the selectable marker for plants. T his would eliminate the 
cloning of the selectable marker on the muiti gene plasmid. The selectable marker can be 
delivered by mixing two different Agrobactermm strains, one containing the muftigeoe plasmid 
and the other containing the selectable marker, or by using the same Agrobaeiermm strain but 
having different isolates containing either the mufti gene plasmid or the selectable marker, or by 
having the selectable marker coexisting in the same Agrobactermm ceil with the multigene 
vector, but on a separate plasmid with a compatible origin of replication. 

One can also envision reducing the size of the selectable marker being used by using a 
tram complementation strategy. For example, one could transform a plant with a portion of a 
Nptll gene thai expresses a partial protein, if the transformation plasmid carries the 
complementary portion of the Nptll protein, both fragments of the Nptll protein may interact to 
confer resistance to kanamyem. This is analogous to the a-complementation strategy used for 
creating functional p-galaetasidase (reviewed by Zabin. 1. Mol Cell Biockem. 49: 87-96. 1982). 
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An example of an optimal starting plasmid for engineering multiple genes in plants 
would contain only the minimal essential elements required for replication in Escherichia coli 
and in Agrobaclermm (having all other required functions encoded in trans) as well as a 
selection scheme that (1) reduces the need for redundancy in the selectable marker, and/or (2) 
reduces the size of the selectable marker, or (3) removes the necessity of having the plant 
selectable marker on the imilti gene plasmid. The promoter used for driving the gene of interest 
in the multi gene vector should consist of the minima! essential elements required for temporal 
and spatial expression. The termination and poiyadenyiation signals should also contain only 
those sequences required for essential function. 

EXAMPLE 11: l^lvd^hy dr o xybutvrate) pr oduction in oilseed leukoniasts of Smv. 

Using plants as factories is attractive for the production of biodegradable plastics since 
current fermentation technology used for the commercial production of poiyhydroxyalkanoates 
(PHA) is prohibitively expensive. The simplest FHA t poly-pMiydroxybutyrate (PHB), has 
previously been produced in leaves of Arabidopsis tkalkma (Nawrath, C, et al., Proa Nad. 
Acad ScL U.S.A., 91: 12760-12764, 1994). Brawca napus oilseed, however, may provide a 
better system for PHB production because acetyl-CoA, the substrate required in the first step of 
PHB biosynthesis, is prevalent during tatty acid biosynthesis. Three enzymatic activities are 
needed to synthesize the PHB polymer: a p'-ketofhtoiase, an acetoacetyi-CoA reductase and a 
PHB synthase. Genes from the bacterium Ralstonia eutropha encoding these enzymes were 
independently engineered behind the seed-specific LesquereMa fendhri oieate-12 hydroxylase 
promoter in a modular fashion. The gene cassettes were sequentially transferred into a single, 
multi-gene vector which was used to transform Brassica napus. PHB accumulated in 
leukoptests to levels as high as 7.7% of seed dry weight. Electron microscopy analyses indicate 
that leukopiasts from these plants are distorted, yet intact, and appear to expand in response to 
polymer accumulation, 

Polyhydroxyaikanoates (PHAs) comprise a class of biodegradable poiymers which offer 
an environmentally-sustainable alternative to petroleum based plasties (reviewed by Pokier. Y., 
eX&UBioteehnoh&v, 13: 142-150, 1995). The homopoiymer Polyt^hydroxybutyrate) (PHB), a 
particularly well studied PHA, is normally synthesized by various species of bacteria under 
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conditions where nutrients become limited. PHB is stored in granules which can later be 
mobilized to provide a carbon and energy resource for the bacteria. 

One of the best-studied pathways for PHB synthesis is derived from the bacterium 
Raktoma eutropha (Slater, S.C., el ah,/ Bacterial, 170: 4431-4436, 1988; Schubert, P., et ah, J. 
Bad., 170: 5837-47, 1988; Peoples, O.P., and Sinskey, AJ., J. Biol. Chem., 264; 15298-15303, 
1989; Peoples, O.P, and Sinskey, A.J., J. Biol. Chem,, 264: 15293-15297, 1989). The pathway 
requires three enzymes: a p-ketothiolase, an acetoacetyi-CoA reductase, and a PHB synthase 
(Figure 32). R. eutropha uses least two p-ketothioiases, PhhA and BktB (Slater, S.C., et ai, J. 
BacL, 180: 1979-1987, 1998), and both of these enzymes were used in this study. The 
acetoacetyi-CoA reductase and PHB synthase are designated PhbB and PhhC, respectively 
(Peoples, O.P., and Sinskey, A J., J. Biol Chem, 264: 15298-15303, 1989; Peoples, O.P., and 
Sinskey, A. J.. J. Biol. Chem., 264: 15293-15297, 1989). 

R. eutropha is fermented commercially for PHA production, but the process is not 
economically competitive with polymers derived from petroleum. Therefore, novel commercial 
efforts to produce PHAs focus on using plants as polymer factories. In mis respect, our 
laboratory is considering two model systems: production in leaves and production in seeds. 
Since acetyi-CoA is a central metabolite tor both PHB and fatty acid biosynthesis, and Brassica 
napus seeds are extremely efficient in oil production, the Brassica seeds seem an optimal 
environment in which to produce PHB (IIS. Patent No. 5,502.273). Production of PHB in 
Arahidopm thalima leaves has been achieved using R. eutropha enzymes (Poirier, Y„ et ah. 
Science, 256; 520-523. 1 993). and additional work showed that polymer accumulation up to 14% 
of plant dry weight was achieved when the PHB biosyothetie enzymes were targeted to the 
plastid (Nawrath, C s et ai, Proc Nat, Acad $cL t 91; 12760-12764, 1994). 

The work presen ted here demonstrates polymer production in the seeds of Brassica napus 
using a multi-gene vector approach. A significant advantage to using these multi-gene vectors is 
that the entire PHA pathway is introduced simultaneously, thereby obviating the need for 
elaborate crossing strategies and eliminating the problems associated with insertions! effects at 
multiple loci. Construction of these multi-gene vectors involved the generation of modular 
cassettes, each harboring an individual gene. The cassettes were then assembled, into a single 
vector expressing the entire PHB biosyothetie pathway (Figure 33). Each cassette consisted of 
the LesqaereUafendieri oieate 12 hydroxylase promoter (Broun, p, et a!,. Plant I, 13: 201-210. 
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1998), a ehioropiast transit peptide fused to the open reading frame of interest (MrB. />M,4, 
orphbQ, and the 3' termination region of the Pkiw! sativum rbcSE9 gene (Coruzzi, G., et 
at, EMBO.L, 3: 1671-1679, 1984). The LesqmreHa promoter contains 2,2 kh ofDNA upstream 
of the coding region for the oieate-12 hydroxylase gene. This promoter was chosen because it is 
expressed concurrently with the accumulation of storage lipid (Broun, P., et aL Plant J., 13:20.1- 
210,1998). 

Expression of the PHB pathway in B. napm was achieved using ^gro&acfer««w~mediated 
transformation, and glyphosate selection was used to identify transgenic events (Fry, J., et aL 
Plant Cell Rep., 6: 321-325, 1987). The T-DNA transferred into the plants from these 
experiments exceeded 16 kiiobases in size. The co-expression rate of genes from the multi-gene 
vectors in Brassiea seeds was high, with 87% of the glyphosate resistant plants also producing 
polymer. Polymer levels ranged from 0.02-7.7% for the transgenic plants carrying pM0N36814 
(R. eutropha phbA, phbB, phbC) and 0.02-6.3% for those carrying pMON36816 (if. eutropha 
bktB, phbB and phbC). The vast majority of plants producing polymer fail within the 0-3.0% 
polymer range (Table 8) and ail polymer-producing tines generated viable seed. 



Table 8. Polymer resujts from canola multigene vector transformations. 
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The B. napus line displaying 7.7% polymer was further analyzed by electron microscopy. 
Micrographs revealed that polymer accumulated within the plastid (Figure 34), and that 
essentially every plastid contained polymer. Polymer production in the plastids is seemingly 
well tolerated; the size of the plastid expands to accommodate polymer production (compare 
Figures 34A and 34B). Ibis phenomenon is similar to the size changes observed when 
amyktplasts accumulate starch, and suggests that plastids will change size to accommodate 
accumulation of any granular product. Thus, the signal initiati ng an increase in plastid volume is 
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not specifically linked to accumulation of normal metabolites; rather, the increase is probably 
initiated simply by physical pressure applied to the plastid membrane. 

These results demonstrate that PHA accumulation is possible in an oilseed system. 
Commercial oilseed PHA production will require approximately twice the amount of PHA 
accumulation achieved here, Moreover, commercial success will rely on the development of an 
integrated processing system to extract PHA, oil and meal from the seeds. We believe that 
increases in PHA accumulation can be obtained using alternative promoters that arc stronger and 
expressed for a longer duration during seed development Other concerns regarding the 
feasibility of PHA production in pianta largely revolve around the metabolic effects of PHA 
production in oilseeds. Specifically; analysis of the effect of PHA production on oil yield will be 
of particular interest, since both are derived from aeetyl-CoA and produced simultaneously. Any 
untoward effect of PHA production on oil yield or seed quality will impact negatively on the 
economic feasibility of using B. napus as a commercial system. 

Vector Co n PI m \ ransformation 

A single vector encoding the entire PHB biosynthetic pathway was used for 
A.grobacterium'modiated transformation of Brassica. This vector, pMON36834, encodes bktB, 
phbB, and phbC (Figure 33). Each gene of interest was fused to a chloroplast transit peptide 
(dp), so each protein is transported to the seed kukopiast. All enzymes were fused to the 
Arabidopsis RuBisCo small subunit la transit peptide that was previously used for PHB 
production (Nawrath, C, et al., Proc. Nat. Acad. Set, 91: 12760-12764, 1994) except PhbB was 
fused to the transit peptide from pea RuBisCo small subunit (Cashmore, A.R., eds. Kosuge, T„ 
Meredith CP.. Holiaender, A., (Plenum, New York), 29-38, 1983). Each gene is controlled by 
the promoter from the fatty acid hydroxylase gene of Lesqwrella (P-Lh; Broun, P., et al., Plant 
J., 13: 201-210. 1998), and terminated with the E9 3'' region of the Pimm rbcSP.9 gene (Coruzzi, 
G., et aL EMBO J., 3: 1671-1679, 1984). P-Lh directs expression of these genes within the 
developing seed. The selection cassette for pMON36M2 and 36814 consisted of the Fig won 
Mosaic Virus promoter followed by the Petunia RuBisCo small subunit la transit peptide, the 
Petunia EPS? synthase genc<CP4) and nopaline synthase V termioation/'polyadenylation region 
(nos3*). 
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Transformation of Brassica napm was done as described in Fry, J. et al. (Ptotf CV// /tep., 
6; 321-325, 1 987) using giyphosate for selection. 

Polymer Analysis 

For isolation of polymer from caaoia seed, seeds were ground to a line powder with a 
mortar and pestle. Approximately 200 mg of each sample were extracted two times in a glass 
tube with 10 mL each of hexane for 1 hour at 60%\ then two times with 10 mL each of 100% 
methanol for one hour at 6G*C. This procedure removes oil from the seed. The material was 
allowed to dry completely overnight. Polymer was extracted from the dried material with 1 ml. 
of chloroform containing 3 jtmol/mL methyl-benzoate standard. The tube was heated to 1G0°C 
for 5 hours and the samples were cooled. One mL methanoi/suiphuric acid (85:15, v/v) was 
added, and the mixture was heated to 100°C for exactly 2,5 hours. The solution was cooled, 
extracted with water and subjected Jo gas chromatography. Gas chromatographic 
characterization of the methyl-ester residues was performed as described by (Slater, S., et aL, J, 
Bact., 180: 1979-1987, 1998) except that the temperature gradient was performed as follows: the 
initial temperature of 70°C was held for 6 minutes, then the temperature was increased by 30°C 
per minute to BOX. Finally, the temperature was increased by 50°C per minute to 3()0 C C and 
held at 300*C for 5 minutes. 



Partial imbibition of Brassica seeds was achieved by the slight abrasion of the seed coats, 
followed by placement for 2 hours onto filter paper moistened with distilled water. The 
cotyledons of these seeds were then cut into 1 mm 3 pieces and fixed in 4% gmtaraldehyde in 0.1 
M sodium cacodykte buffer, pH 7.2 for three hours, with the first 30 minutes under vacuum. 
The tissue was post-fixed in 1% osmium tetroxide in the above buffer, dehydrated in ethanoS and 
propylene oxide and infiltrated with a 1:1 mixture of Spurr's: EMbed 812 resin. The resin was 
polymerized at 60*C for 48 hours. The resulting blocks were sectioned on an Leiea Ultracut E 
microtome. Sections 80 run thick were picked up on foraivar/carbon coated copper slot grids. 
The grids were post-stained with uram-1 acetate and lead citrate in an 1KB altrastainer and 
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examined with a JEOL 1200 transmission electron microscope. (AH reagents were obtained 
from Electron Microscopy Sciences, Fort Washington, PA). 

SMMEIJL&i^^ and Brasxica. for ^M^y^^^^z. 



Poly(hydroxya!kanoates) are natural polymers with thermoplastic properties. One 
polymer of this class, poiy(p-hydroxybutyrate-co-|3-hydroxyvalerate) (PHBV) is currently 
produced by bacterial fermentation, but the process is not economically competitive with 
polymer production from petrochemicals. PHA production in green plants promises much lower 
costs, but producing polymer with the appropriate monomer composition is problematic. By- 
redirecting metabolic pools of both short-chain fatty acids and amino acids, Ambidapm and 
Brassiea have now been engineered to produce PHBV, a copolymer with commercial 
applicability, in this Example, polymer production, metabolic intermediate analyses, and 
pathway dynamics for PHBV synthesis in plmta are described. 

Pcuy(hydroxyalkanoates) (PHAs) are a class of polymers accumulated by numerous 
bacterial species as carbon and energy reserves. These polymers have thermoplastic properties, 
and have received much attention as biodegradable alternatives to petrochemical plastics 
(Anderson, A. J., and Dawes, E. A. Microbiol, Rev. 54; 450-472, 1990). While the 
homopolymer polyip-hydroxybutymie) (PHB) is somewhat brittle, many copolymers such as 
poly(p-hydroxybutyrate-co-p-hydroxyvalerate) (PHBV) are more flexible due to reduced 
erystaflimty, and suitable for many commercial applications. 

The biochemical pathways for PHB and PHBV production are essentially identical, 
differing only in the initial metabolites. PHB synthesis is initiated by condensation of two 
acetyl-CoA molecules, whereas PHBV synthesis requires the additional condensation of acetyl- 
CoA with propionyi-CoA. Following condensation, the products are reduced by a D-isomer 
specific acetoacetybCoA reductase, and the resulting p-hydroxy products are polymerized by 
PHB synthase (Anderson, A. 1, and Dawes, E. A. Microbiol Rev. 54: 450-472, 1990; 
Steiabiichel and Schlegel Mol Microbiol, 5(3):535-42, 1991). 

PHBV is produced commercially by growing Ratstonia eutropha on glucose and 
propionate (Byrum, D. FEMS Microbiol Rev. 102: 247-250, 1992), but the cost of this process 
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prohibits large-scale fomentation. Production of" PHAs via genetic engineering of green plants 
is expected to reduce costs to economical levels (van der Leij, P. R„ and Witholt, B. Can. J. 
Microbiol 41(SuppU): 222-238, 1995), and production of PHB homopolymer in plants has 
been demonstrated (Pokier, Y., et al. Science 256: 520-523, 1992; Nawrath, C; et al. Proc. Nail. 
Acad Set. 91: 12760-12764, 1994). However copolymer production has been problematic, 
primarily due to the requirement for metabolic precursors other than acetyl-CoA, 

Here we report metabolic engineering of plants to produce PHBV copolymer. By 
expressing four distinct transgenes and diverting metabolic pools of acetyi-CoA and threonine, 
copolymer was produced in Arabidopsis thaliana, and in the seeds of Brasska napm (oilseed 
rape). PHBV copolymer production opens the use of green plants as factories for commercial, 
environmentally-sustainable production of biodegradable plastics, 

R esults: A Pathway for PoM&-hvdroxvbutvmte-<x)-8-h v<iroxvvalerate^ Prod uction in Plants. 

A pathway designed to engineer PHBV production in the plastids of plants h 
diagrammed in Figure 35. Aceiyl-CoA is drawn from piastid intermediary metabolism, whereas 
propionyi-CoA is generated from threonine via 2-ketobutyrate (Gruys et al WO 98/00557; 
Eschenlauer, A.C., et at Int. J. Biol. Macromol. 19: 121-130, 1996) This pathway requires 
transformation of the plant with four separate genes: ilvA. bktB, phbB, and phbC. it also relies 
on the endogenous piastid pyruvate dehydrogenase complex (PDC). The threonine deaminase 
used in these studies is the biosyafhetic enzyme IlvA from E. mli (Taiilon, B.E., et al. Gene 63: 
245-252, 1988). The acetoacetyl-CoA reductase (PhbB) and PHB synthase (PhbC; are the same 
K. eutropha enzymes used in earlier in pianta studies (Pokier, Y., et. al. Science 256: 520-523, 
1992; Nawrath, C.i et al. Proc. Natl. Acad Sci. 91: 12760-12764, 1994). The ^keiothiokse is 
BktB from R. eutropha (Slater, S., et al. J. Bacterial 180: 1979-1987, 1998). Previous work on 
PHB production in plants used the if. eutropha PhbA p-ketothioiase. However, PbbA cannot 
efficiently synthesize p-keiovakryi-CoA, whereas BktB produces both p-ketovaleryl-CoA and 
acetoacetyl-CoA. 
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Polymer production was studied in both Arabidopsis thaliana leaves and Brassica napus 
seeds. For PHBV production in Arabidopsis, two separate vectors were constructed. Plasmid 
pMON256?8 encodes phbB and phbC, and plasmid pMON25812 encodes bkiB and ilvA. 
Transgenic Arabidopsis were generated by simultaneous Agrobacterium-mediated 
transformation with both vectors, and subsequent selection on both giyphosate and kanamyein. 
All genes were controlled by the e35S promoter (Odeil, J.T., et ai. Nature 313: 810-812, 5985), 
leading to polymer production throughout the plant, in Brassica, all four genes in the transgenic 
pathway were expressed from a single vector, pMON36824, and polymer production was 
directed to the seeds by the Lesqueretla hydroxylase promoter (Broun, P„ et ai. Plant J. 13; 201- 
210, 1998). 

Previous work on PHA production in plants has shown that polymer is produced 
efficiently and that phenotypic effects on the plant are minimized when PHA production occurs 
in the chloropiasts (Nawrath, C. et al. Proc. Natl. Acad. ScL 91: 12760-12764, 1994). The 
plastids are the site for synthesis of both oil, which is derived from acetyl-CoA, and threonine 
which is used to produce propionyl-CoA. In both Arabidopsis and Brassica, the PHA 
biosynthesis enzymes were targeted to the plastids using chioroplast transit peptides. In 
photosynthetic tissues of Arabidopsis the proteins are targeted to the chloropiasts, whereas in 
Brassica seeds the enzymes are targeted to the leucopiasts. 

Generate ooi Fropionvl-i :pA t rot i [hreoi int 

Conversion of threonine to 2-ketobutyrate by IlvA is the first reaction catalyzed by one of 
the recombmantly-encoded enzymes. HvA normally catalyzes the initial step in the conversion 
of threonine to isoleucine, and the enzyme is feedback-inhibited by isoleucine (Umbarger, H.E, 
Biosynthesis of branched-chain amino acids, pp. 442-457 in Escherichia colt and Salmonella: 
Cellular and Molecular Biology, Neidhart, F.C, Curtiss, R., Lin, E.C.C., Low, K.B., Magasanik, 
B., Reznikoff W.S., Riley, M„ Schaeehier, M., and Umbarger. H.E. (eds,}.ASM Press, 
Washington, D. C, 1996). However, ilvA mutants with diminished sensitivity to isoleucine 
have been described and two such mutants, ih>A466 (Pledger, W.J.. and Umbarger, H.E. J. 



W0 60/52183 



PCIYUSOfl/05931 



-43- 

BacmioL IM: 183-194, 1973; TaiUon, B.E., et al. Gene 63: 245-252, 1988) and ilv.A2.l9 (Bonis, 
R.O., etai. J. Biol. Chern. 254: 1074-1079, 1979; Eisenstein, B., et al. Biochemistry 34: 9403- 
9412, 1995), were used along with wild-type UvA in these studies. «vA466 is partially sensitive 
to feedback inhibition by isoleucine, and TlvA219 is essentially insensitive (Pledger, W.J., and 
Umbarger, H.E. J. Bacterial 114: 195-207, 1973; LaRossa, R.A., et al. </. Bacterial. 169: 1372- 
1378, 1987). 

Both Arabidopste and Brassica were initially transformed with separate vectors 
expressing wild-type i/wl, fttAtftf, and *te«/P. In both organisms, no fertile transfonnanis 
expressing HvA219 were recovered, indicating that expression of completely isoleudne- 
insensitive UvA is lethal. In Arabidopsis, plants expressing UvA466 were recovered at a very 
low frequency, whereas Brassica tolerated UvA 466 rather well. This result may be due to the 
seed-specific nature of the .lesquereUa promoter. Transforawnts expressing wild-ty pe UvA were 
efficiently recovered in both Arabidopste and Brassica, 

in order to monitor the metabolic effects of IlvA in transgenic plants, metabolites likely 
to be effected by this enzyme were analyzed. Figure 36 shows profiles of selected 2-ketoacids 
and amino acids in a control plant, and in transgenic Arabidopsis expressing wild-type UvA. As 
expected, the transgenic plant had elevated levels of both 2-ketobutyrate and isoleucine. In 
addition, a high concentration of 2-aminobutyrate was present. Formation of 2-aminobwyrate 
from 2-ketohutyrate is a freely-reversible reaction, probably catalyzed by the same branched- 
chain amino acid transaminase that catalyzes the final step in isoleucine biosynthesis (Singh, 
B.K. (1999) Biosynthesis of Valine, Leucine and Isoleucine. In: Singh, B.K, (ed.) Plant Amino 
Acids: Biochemistry and Biotechnology. Marcel Dekker, Inc., New York, pp.227-247, 1998). 
Although transgenic plants expressing UvA contained more 2-fcetobutyrate than did wild-type 
plants, the 2-ketobutyrate concentration was still below that of pyruvate. Most 2-ketobutyrate 
was apparently diverted to produce 2-aminobiityrate and isoleucine. The concentration of free 
threonine in a plant expressing UvA decreased by only about 15%, suggesting that threonine 
synthesis was sufficiently robust to compensate for the diversion of threonine through 2- 
ketobutyraie. Similar analyses were performed on the seeds from control and transgenic 
Brassica, and essentially the same results were obtained. In plants expressing UvA, isoleucine. 
2-ketobutyrate, and 2-taninobutyrate concentrations were elevated, and free threonine was only 
marginally decreased (K. GmysetaL unpublished data). 
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The second step in the formation of propionyl-CoA is catalyzed by the plastid pyruvate 
dehydrogenase complex, which is the sole endogenous enzyme required for PHBV production. 
This enzyme complex normally plays a central role in metabolism by converting pyruvate to 
aeetyi-CoA. We found thai PDC from isolated Brassica leukoplasts was also capable of 
converting 2-ketobutyrate to propionyi-CoA. However, PDC was approximately 10-fold less 
efficient when utilizing 2-ketobutyrate than when utilizing pyruvate; the specific activities were 
0,4 imits/mg and 3.6 units/tng for 2-ketobutyrate and pyruvate, respectively. 

Synthesis of PHBV Cop oivmar. 

Once propionyl-CoA has been produced, the pathway is identical to that shown to 
produce PHBV copolymer in recombinant E. colt (Slater, S., et al ,/. Bacterial. .180: 1979-1987, 
1998). Propionyl-CoA is converted to D-pdiydroxyvaleryl-CoA by BktB and PhbB, and then is 
polymerized with D-p-hydroxybutyryl-CoA to .form PHBV copolymer. The functionali ty of the 
entire pathway in plants is shown in Figure 37, which shows f H-NMR spectra demonstrating the 
presence of PHBV copolymer in Arabidopsh, We also obtained !3 C-NMR demonstrating PHBV 
copolymer production in Brassica, arid all the.se data have been corroborated by coupled gas 
chromatography-mass spectrometry (data not shown). The molecular weight of PHBV isolated 
from Brassica seeds was approximately i x 10«, with a polydispersity index of 2.4. These 
parameters are suitable for commercial applications. 

Although copolymer was made in both Arahidopsis and Brassica, the 3-hydroxyvalerate 
component varied with the in vivo polymer concentration. The polymer composition in Brassica 
seeds distinctly showed a negative correlation between the 3 -hydroxy valerate content of foe 
polymer and total polymer production (Figure 38). Threonine deaminase activity also negatively 
correlated with 3-HV content (Figure 38), a somewhat surprising result considering the role of 
llvA in the production of 3-HV. However, we have consistently found that introduction of 
vectors encoding multiple genes leads to a general concerted expression of all encoded enzymes. 
Thus, elevated IlvA activity is consistent with elevated polymer production. 



Discussion 
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The use of green plants as industrial factories will often require significant changes in 
plant metabolism, so metabolic engineering of multi-step pathways will become an important 
technology in "green chemistry* 5 efforts. In this study, production of the PHA copolymer PHBV 
has been accomplished using a combination of endogenous and transgene- encoded enzymes, 
s The pathway consists of five separate enzymes, four being encoded as transgenes. In the case of 
Brasska, all four genes were successfully introduced on a single vector. 

Commercial application of this technology will rest on two primary metabolic issues: 1) 
can polymer be produced in planta to concentrations amenable to economical polymer 
extraction? and 2} as the polymer concentration increases, can the appropriate monomer 
composition be maintained? We expect that polymer concentrations in planta will need to reach 
at least 15% of dry weight for economical production to be feasible. PHB homopolymer 
concentrations near 15% have been reported (Nawrath, C. ei al. Proc. Natl Acad Set, 91: 12760- 
12764, 1994} and have also been achieved in our laboratory (data not shown). Thus, high-level 
PHB production appears technically attainable. 

Production of PHBV copolymer has been accomplished in this study, although all plants 
produced copolymer at levels below 3% of plant tissue dry weight. The next challenge is high- 
level production of copolymer, and the data in Figure 38 show that additional work is required to 
maintain the 3-hydroxyvalerate composition at high polymer concentrations. Specifically, as 
polymer production increased, the 3-hydroxyvalerate fraction of the polymer decreased, and 
increasing threonine deaminase expression did not effect this correlation. These data suggest a 
metabolic bottleneck in the provision of 3-hydroxyvalerate to PITA synthase. The BktR, PhbB, 
PhbC pathway efficiently synthesizes PHBV copolymer (Slater. S., et al. J. Bacteriol. 180; 3979- 
3987, 1998), and production of 2-ketobutyrate in planta is efficient, as estimated from the 
elevated levels of 2-ketobutyrate, 2-aminobuiyrate and isoleueine (.Figure 36). Thus, the 
metabolic bottleneck must exist at the conversion of 2-ketobutyrate to propionyl-CoA by the 
pyruvate dehydrogenase complex. As noted above, the PDC strongly prefers pyruvate as a 
substrate, and tins difference is compounded in vivo by the concentration ratio of pyruvate to 2- 
ketobutyrate (Figure 36). Pyruvate dehydrogenase apparently cannot effectively compete for 2- 
ketobutyrate so propionyl-CoA sytraiesis is limited.. 

Production of copolymer to high internal concentrations may require a supplementary 
route for conversion of 2-ketobutyrate to propionyi-CoA. There are several ways to bypass the 
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PDC or supplement its activity, but all will require additional transgenes. These routes include 
modifying the a-ketoacid dehydrogenase to more readily accept propionyl-CoA (inoue H, et al. ./ 
Bacterial. 179: 3956-3962, 1997; Grays et al WO 98/0055?), expression of an alternative 
enzyme complex capable of forming propionyl-CoA from 2-ketobutyrate (Kerseher, L. and 
Oesterhelt, D., Eur. J. Biochem. 116: 587-594, 1981), or co-expression of a propionyl-CoA 
dehydrogenase (Horswiii et ai; Mitsky et al, unpublished data) with a propionyi-CoA synthetase 
or CoA transferase (Grays et ai WO 98/00557: Valentin et ai manuscript in. preparation). Thus, 
a commercially viable transgenic plant producing PHA polymer from threonine may contain up 
to six separate transgenes. 

Synthesis of propionyl-CoA can also be achieved through other metabolic pathways, 
although none presents a straightforward alternative to the threonine derived pathway (Figure 
39). For instance, propkmyi-CoA may be generated from acetyi-CoA using a 5-step pathway, 
part of which is involved in propionyl-CoA degradation in plants (Goodwin, TAV. and Mercer, 
E.L Introduction to Plant Biochemistry. Seeond Edition. Pergamon Press, Oxford, 1985; 
Eisenreich, W.. et ai. Eur. J Biochem. 215: 619-632, 1993; Preifert, H„ and Steinbuchel, A. 1 
Bacterial 174: 6590-6599, 1992: Podkowinski, J., et al. Proe. Natl. Acad Sci. USA 93: 1870- 
1874, 1996; Sun, J„ et at. Plant Physiol. 115: 1371-1383. 1997; Horswiii A.R., and Escalante- 
Semerena J.C. J. Bacterid. 379: 928-940, 1997; Grays el ai, unpublished data). Conversion of 
acrylyl-CoA to propionyl-CoA is potentially problematic, but an appropriate enzyme may be 
available from Chrorofiexus aurantiacus (Eisenreich, W.„ et al. Eur I Biochem. 215: 619-632. 
1993). Propionyl-CoA can also be derived from succinyl-CoA using a pathway present in both 
Rhodococcus ruber and Nocardia corallim (Williams, D.R.,et al. Appl. Microbiol. BmtecknoL 
40: 717-723, 1994; Valentin, H.E., and Dennis, D. Appl. Environ. Microbiol. 62: 372-379, 1996). 
This pathway is initiated by methyl makmyl-CoA mutase, an enzyme that requires vitamin B 12 as 
a cefaclor. However, vitamin Bn is not synthesized in plants (Goodwin, T,W. and Mercer, E.I. 
Introduction to Plant Biochemistry. Second Edition. Pergarnon Press, Oxford, 1985). 
Rhodococcus and Nocardia also produce minor amounts of 3-hydroxyvaieryl-CoA via a 
different, uncharaeterized route. This route may be a link to amino acid metabolism, such as the 
pathways used by other bacteria and animals to degrade valine and tsoiencine (Figure 39). These 
pathways might also be engineered in plants, but a large number of genes are required. 
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Several other amino acids can be used to produce propionyl-CoA. Methionine, like 
threonine, generates 2-ketobutyrate during cataboiism. Tins conversion is catalyzed by L- 
methionine y-iyase in a reaction that also produces ammonia and methanethiol (Tanaka, H„ ei ai. 
Emyme Microb. TechnoL 7: 530-537, 1985), The effect of methanethiol production on plants is 
unknown, and supplementation of PDC activity would still be required to efficiently produce 
propionyl-CoA. Another pathway, present in Clostridium propianieum, converts alanine to 
propionyl-CoA via lactic acid, lactyl-CoA and acrylyl-CoA (Sehweiger, G., and Bucket W, 
FEES Lett. 571: 79-84, !984; Garden, B. P., and Barker, H. A. Arch. Biochem. Biophys. 12: 
165-180, 1947). However, none of the required genes has been cloned, and some of the 
necessary enzymes are oxygen sensitive (Hofmeister, A.E.M.. and Bucket W. .Eur. J, Biochem. 
206: 547-552, 1992: Kuehta, R.D., and Abeles, R.H. J. Biol. Chem. 260: 13181-13189, 1985). 
Manine is another potential starting metabolite for the production of propionyl-CoA (Arst, 
RN. Jr. Mai. Gen. Genet. 163: 23-27, 1978; Roberts, E„ and Bregotl HM. J. Biol Chem. 201: 
393-398, 1953; Kupiecki, R.P., and Coon, M.J. J. Biol. Chem, 229: 743-754, 1957). ^-alanine 
normally plays a critical role as a precursor to Coenzyme-A and acyl carrier protein. However, 
little is known about the concentration and eompartmentali nation of Maame in plants, and 
propionyl-CoA may actually be required for its synthesis. 

In summary, poly(p-hydi»xybwymt^co-p-hydroxyvaJerate) copolymer was produced in 
both Ambidopsis and Brassiest by simultaneously accessing amino acid and short-chair! fatty- 
acid metabolite pools. In Brassica, all four required tmnsgenes were introduced on a single 
vector, eliminating the plant crossing normally necessary to assemble a pathway of this size. 
The polymer molecular mass was adequate for commercial purposes, but an apparent metabolic 
bottleneck in conversion of 2-ketobutyrate to propionyKx-A suggests that additional engineering 
may be required to achieve high-level production of polymer with the necessary (3- 
hydroxyvaierate composition. 

Gene ration ofilvA mutants. 

All UvA alleles used herein are derived from the £. coii ilvA eene (Lawther, R.P. et ai 
Nucl Acids Res. 11: 2137-2155, 1987) that is harbored in pMON25659 (Grays et a] WO 
98/00557), The UvA2l9 mutation (Eisenstein, E., et al. Biochemistry. 34: 9403-9412, 1995) and 
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ilvA466 mutation (Taiilon, B. E., et ai. GW?t>. 63: 245-252, 1988X both originally isolated in 
Salmonella typhi murium, were introduced into the E. coli gene by oiigonuekoude-directed 
mutagenesis as previously described (Grays et a!. WO 98/00557). 

Pja^mkl ( unsmjcunn andj xansformannn if Arabufai _ c / 

AH transformation vectors are derived from pMON10098, a vector designed for 
.4gro6ac/e«am-mediated transformation of plants that encodes the apt!.! selectable marker, The 
trfA function Is provided in trans by the host bacterium, Agrobacterium tumefaciens ABI. A. 
tmmfaciens ABI is Agrobacterium strain GV3I01 (Van Larebeke. N, et ai. Nature. 252: 169- 
170, 1974) harboring the helper plasmid pMP90RK (Koncz, C. and Scheil, J. Mel, Gen. Genet. 
204: 383-396, 1986). 

All PHA production genes used in this study were initially constructed in intermediate 
vectors as cassettes including a promoter, a chioropiasi transit peptide fused to the gene of 
interest, and a 3' control region. In every case, the gene cassette is flanked by Not 1 restriction 
sites, plus several additional unique restriction sites. Each cassette was excised from it's 
intermediate vector using appropriate restriction enzymes, and sequentially heated into the 
recombinant vector for plant transformation. 

For metabolite analysis, Arabidopsis was transformed with either pMON 15715. an ifoA~ 
negative control vector, or pMON25668, which expresses both phbA and wild-type ilvA from 
e35S promoters. 

for production of PHBV in Arabhlopslu two separate piasmids were used. 

The first vector encoded both phbB and phbC <pMON25678), and the second vector 
encoded both bktB and ilvA (pMON25812). All genes were controlled by the e35S promoter 
(Odell, I. T., et ai. Nature. 313: 810-812, 1995) and the E9 V region (Coruzzt G„ et al. EMBO 
J. 3: 3675-1679, 1984). All enzymes were fused to the Arabidopsis RuBisCo small subunit la 
transit peptide that was previously used for PHB production (Nawrath, C, et al. Proc. Nad. 
Acad ScL 91: 12760-12764, 1994). Piasmid pMON25678 encodes resistance to glyphosate, 
whereas pMON25812 encodes resistance to kanamycm. Both piasmids were simultaneously 
used for Agrobactermm-medimed Arabidopsis transformation (Beehtold N„ et al. Camples 



WO 00/52183 



PCT/lfSOO/05931 



-49- 

Rendus Acad Set. Paris Sciences Serie III Sciences de la Vie, 316: 1194-1199, 1993), and 
transformaras were selected on both glyphosate and kanaraycin as follows. 

Arabidopsis thediaaa Columbia plants were grown in Metro Mix 200 in 2.5 in. pots 
covered with a mesh screen. Sown seed was vernalized for 5 days and germinated under 
conditions of 16 hours light /8 hours dark at 20*C to 22°C, 75% humidity. Plants were watered 
and fertilized twice weekly with ',4X Peters 20-20-20 until infiltration, 

A 1:50 dilution of an overnight culture of Agmbacterium tumefaciem ABI strain was 
grown at 28°C in YEP containing Spectinomycin 100 mg/L, Streptomycin, 100 mg/L, 
Chloramphenicol 25 mg/L, and Kanamycin 50 mg/L. Each culture, contained a different ABI 
construct. After 16-20 hours the Agrobwlerium cultures were concentrated by centrifugation. 
The supernatant was discarded and the cell pellets were dried and resuspended in infiltration 
medium (MS Basal Salts 0.5%, Gamborg's B-5 Vitamins 1%, Sucrose 5%, MES 0.5 g/L, pH 
5.7) with 0.44 nM benzylaniinopurme (.1 0 pL of a 1 .0 mg/L stock in DMSO per liter) and 0.02% 
Silwet L-77 to an OX> m of 0.8. For co-infiltrations each culture was resuspended as described 
above and 150 mL each of two cultures were combined for a total of 300 mL. 

Plants were soaked in water 30 minutes prior to infiltration, inverted plants were placed 
into the cultures and vacuum infiltrated at 27 in. Hg for 10 minutes. The plants were placed on 
their sides in a diaper-lined tray and covered with a germination dome for one day. The pots 
were then turned upright and were not watered for five days. Infiltrated plants were grown to 
maturity as described above. Ripe seeds were harvested and sterilized. Harvested seed was 
placed in a 15 mL Coming tube and sterilized. The tubes containing seed were placed on their 
sides with lids loosened in a vacuum dessicator containing a beaker of Clorox and 1:100 
hydrochloric acid. The dessicator was then sealed with a vacuum and the seed remained in the 
dessicator overnight. Sterilized seeds from co-inftitrated plants were placed on media containing 
MS Basal Salts 4.3 g/L, Gamborg's B-5 (500 X) 2.0 g/L, glucose 10 g/L, MES 0.5 g/L, and 8 
g/L phytagar with carbenieiiKn 250 mg/L, cefotaxime 100 mg/L, kanamycin 60 mg/L and 4 roM 
glyphosate. The seed was germinated at 26°C, 20 hours light / 4 hours dark. Transforms 
were transferred to soil and covered with a germination dome for one week. The plants were 
grown in plant growth conditions described above. 

For transformation of Brassica napus, a single vector encoding the entire PHBV 
biosynthesis pathway was used. This vector, pMON36824, encodes bktB, phbB, phbC, and 
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ilvA466 (Figure 3). As with the Ambidopsis vectors, each gene of interest was fused to a 
ehloropiast transit peptide, so each protein is transported to the seed leukoplasi. All enzymes 
were fused to the Ambidopsis RuBisCo small subunit la transit peptide that was previously used 
for PHB production (Nawrath, C, et ai. /Voa Afetf. /kW. &l 91: 12760-1 2764, 1994), except 
PhbB was fused to the transit peptide from pea RuBisCo small subunit (Cashroore, A.R. Nuclear 
genes encoding the small subunit of rihulose-1 ,5-hisphosphate carboxylase, pp. 29-38 in Genetic 
Engineering of Plants, Ko.suge, T., Meredith, CP., Holiaender, A. (eds). Plenum, New York, 
1983). Each gene is controlled by the promoter from the fatty acid hydroxylase gene of 
Lesqmrella (P-Lh: Broun, P.. et a!. Plant J. 13: 201-210, 1998,, and the E9 3* region (Corasri, 
G„ et al. EMBO J. 3: 1671-1679, 1984). P-Lh directs expression of these genes within the 
developing seed. Transformation of Brassica was performed as described by Fry et ah {Plant 
Cell Rep. 6: 321-325, 1987), and transformants were selected on glyphosate. 

ISSMmMMmiSfLm^ feukopiasts an d analyst s of pyruvate dehydrogenase comp ly ^riviry 

LeukopJasts were isolated essentially as described by Kang and Rawsthamc (Plant J, 6: 
795-805, 1994). Isolated ieucopiasts were lysed by somcation and debris removed by 
centrifugatior, at 10,000 x g for 10 minutes. The crude extract was desalted using Pharmacia 
NAP-5 columns and the protein concentrations determined by the Bradford method (Bradford. 
ML Anal Bhchem. 72: 248-254, 1976). Five to 50 uL were added to assay mix which contained 
final concentrations of: 100 mM EPPS, pH 8.0; 5 mM MgCfe; 2.4 mM coenzyme- A; 1.5 mM 
NAD ; and 0.2 mM TPP (coearboxyiase). The reaction was initiated with addition of either 
pyruvate or 2-ketobutyrate substrates to final concentrations of 1.5 mM and 30 mM, respectively. 
To aid in analysis and ensure peak identities, U C labeled pyruvate and 2-ketobutyrate were 
spiked into both substrates. The reactions were quenched with 30 pi, of 10% formic acid after 2 
to 30 minutes. 100 pJL of the reaction was injected onto a Beekman Uhrasphere HPLC column (5 
uM, 4.6 mm x 15 cm) and elated with 1 ml/minute gradient of solvent A (50 mM ammonium 
acetate buffer pH 6.0 containing 5% aeetonitrile) going from 0 to 40 % solvent 8 (acetonitrile) in 
15 minutes. The reaction was followed by monitoring absorbance of CoA-derived products at 
230 and 260 nm using a phofodiode army detector. Use of radioisotope flow detector allowed 
confirmation of both substrate and product peak, identities. The percent conversion of added 
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substrates was used to determine the specific activities of the extracts. One unit equals one nmo! 
product produced per minute per mg protein in extract. 

< ViTij no j ci d a; )d 2- ketoaci d ,U:.iK.--:> 

Amino Acid analysis was performed by Dr. Donald Willis at Ralston Analytical 
5 Laboratories, essentially as described by Willis (J. Chromatog. 408: 217-225, 1987). 

Extraction and Gas Chro matography Analysis of Po lymer from Arabidopsis 

For isolation of polymer from Arabidopsis, stems and leaves were harvested and 
dehydrated by lyophilization for approximately 36 hours. The material was ground to a One 
powder, and 100 mg of powder was treated with 10 mL Cforox bleach for 1 boor with shaking at 
fo room temperature. The extract was subjected to centrifugation at 1 ,600 x g for ten minutes, and 
the supernatant solutions was carefully removed. Ten mL 100% methanol were added, the 
solution was mixed by vortex, and then centrifuged again. After a second, identical methanol 
extraction, the material was allowed to dry overnight. Polymer was extracted from the dried 
material with 1 ml. of chloroform containing 3 umoi/mL metirybbenzoate standard and 1 mL of 

a tnethanol/sulphurlc add (85:15. v/v). The tube was heated to 100°C for exactly 2.5 hours, and 
the solid material was removed by centrifugation. The solution was cooled, 1 mL water was 
added, and the liquid was mixed using a vortex mixer. The organic and aqueous phases were 
separated by centrifugation at 1,600 x g for ten minutes. The chloroform layer was transferred to 
a clean test tube and vigorously mixed with approximately 200 mg of silica gel. Solid material 

29 was removed by centrifugation, and the supernatant material was subjected to gas 
chromatography. Gas chromatographic characterization of the methyl-ester residues was 
performed as described by Slater et ah (J. Bacterial 380: 1979-1987, 1998), except that the 
temperature gradient was performed as follows. The initial temperature of 70°C was held for 6 
minutes, then the temperature was increased by 30°C per minute to I30°C. Finally, the 

15 temperature was increased by 50°C per minute to 300°C and held at 300°C for 5 minutes. 

Extraction an r nuaranh v Analysis of Polymer from Brasska seeds 
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For isolation of polymer from canoia seed, seeds were ground to a fine powder with a 
mortar and pestle. Approximately 200 mg of each sample were extracted two times in a glass 
tube wiiii 10 mL each of hexane for 1 hour at 60°C, then two times with 10 mL each of 100% 
methanol for one hour at 60°C. This procedure removes oil from the seed. The material was 
allowed to dry to completion overnight. Polymer was extracted from the dried material with 1 
ml. of chloroform containing 3 umol/mL methyl-benzoate standard. The tube was heated to 
100°C for 5 hours and the samples were cooled. One mL methanoJ/sulphuric acid (85:15, v/v) 
was added, and the mixture was heated to 1 00*C for exactly 2.5 hours. The solution was cooled, 
extracted with water and subjected to gas chromatography as described above. 

Characterization of poivmer by nuclear magnetic n . nectroseo pv and gel permeation 
chroma tography 

Nuclear magnetic resonance (NMR) studies were done using a Varian Unity 500 MHz 
spectrometer. Proton spectra were obtained on a Varian pfg 5 mm probe at 30*C from PHA 
samples of approximately 20 mg dissolved in 1 mL deuierochloroform. Acquisitions were taken 
at a 90" pulse, 2.3 g acquisition time, 30 s delay, collecting 65k data points and 16 
accumulations. Chemical shifts were referenced to CHCH (5-7.24 ppm). The 13C{IH) spectra 
(125 MHz) were taken at 30°C on a Nalorac 3 mm U C probe containing a solution of 
approximately 10 mg PHA in 200 uL deuterochloroform. The spectra were obtained using 30° 
pulses, 1.5 s acquisition time, zero delay, 131k data points and 55,296 accumulations. Chemical 
shifts were measured relative to CHCh (6=77.0 ppm). 

Gel permeation chromatography was performed according to Koizumi et al. (J. Al S. 
PureAppl. Chem. A32: 759-774, 1995). 

EXAMPLE 13: Plant Promoters 



Plant promoter sequences can be constitutive or inducible, environmental ly- or 
developmental iy-reguiated, or cell- or tissue-specific. Often-used constitutive promoters include 
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the CaMV 35S promoter (OdeJI et al., Nature 313: 810-812, 1985), the enhanced CaMV 35$ 
promoter, the Figwort Mosaic Virus (FMV) promoter (Rkhins, R.D. et al. Nucleic Acids Res. 
20: 8451-8466, 1987), the mannopme synthase (mas) promoter, the nopaline synthase (wo,*) 
promoter, and the octopine synthase (oes) promoter. Useful inducible promoters include 
promoters induced by salicylic acid or polyacryik acids (FR-L Williams , S. W. et al. 
Biotechnology 10; 540-543, 1992), induced by application of safeners (substituted 
benzenesulfonamide herbicides, Hershey, H.P. and Stoner, T.D.. Plant Moi Biol. 17: 679-690, 
1991), heat-shock promoters (Ou-Lee et al., Proc. Natl. Acad. Set. US. A. 83: 6815-6819, 3986; 
Amiey, W.M. et al., Plant Moi Biol 14: 949-967, 1990), a nitrate- inducible promoter derived 
from the spinach nitrite reductase gene (Back, E. et ah, Plant Moi. Biol 17: 9-18, 1991), 
harmone-mducible promoters (Yamaguchi-Shinozaki. K. et al., Plant Moi Biol. 15: 905-912, 
1990: Kares et al.. Plant Moi Biol. 15: 905-912, 1990). and iight-inducibk promoters associated 
with the small subunit of RuBP carboxylase and LHCP gene families (Kuhlemeier et al. Plant 
Cell 1: 471-478, 1989; Fembaum, RX. etaL Moi. Gen. Genet. 226: 449-456, 1991; Weisshaar, 
B. et al., EMBO J. 10: 1777-1786, 1991: Lam, E. and Chua, N.H., J Biol. Chem. 266: 17131- 
17135, 1990; Castresana, C. et al., EMBO J. 7: 1929-1936, 1988; Schuize-Lefert, P. et al., 
EMBO ..I 8: 651-656, 1989). Examples of useful tissue-specific, deveiopmentally-reguiated 
promoters include the p-eonglycinin 7S promoter (Doyle, J J. et ah, 7 Biol. Chem. 261: 9228- 
9238, 1986; Slighton and Beachy, Phmta 372: 356, 1987), ami seed-specific promoters 
(Knuteon. D.S. et al., Proc. Nail Acad. Set. U.S.A. 89: 2624-2628, 1992; Bustos. MM, et al., 
EMBO J. 10: 1469-1479, 1991; Lam, E. and Chua, N.HL, Science 248: 471-474, 1991; Stayton et 
al., Aust. I Plant. Physiol. 18; 507, 1991). Plant functional promoters useful for preferential 
expression in seed plastids include those from plant storage protein genes and from genes 
involved j n fatty acid biosynthesis in oilseeds. Examples of such promoters include the 5* 
regulatory regions from such genes as napin (Kridl et al.. Seed Sci. Res. 1: 209-219. 1991). 
phaseolin, zein, soybean trypsin inhibitor, ACP, stearoyl-ACP desaturase. and oleosin. Seed- 
specific gene regulation is discussed in EP 0 255 378, Promoter hybrids can also be constructed 
to enhance transcriptional activity (CornaL L. and Moran, P.M., U.S. Patent No. 5,106,739, 
issued April 21, 1992), or to combine desired transcriptional activity and tissue specificity. 
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1 - - ! 1 1 ' 1 T «? i'loii and reg eneration 

A variety of different methods can be employed to introduce such vectors into plant 
protoplasts, cells, callus tissue, leaf discs, meristeras, etcetera, to generate transgenic plants, 
including Agrobacteriutn-me&at&d transformation, particle gun delivery, microinjection, 
electronoration, polyethylene glycol mediated protoplast transformation, uposome-mediated 
transformation, etc. (reviewed in Potrykus, Am. Rev. Plant Physiol. Plant Mol. Biol. 42: 205- 
225, 1991). In general, transgenic plants comprising ceils containing and expressing DNAs 
encoding enzymes facilitating PHA biosynthesis can be produced by transforming plant ceils 
with a DNA construct as described above via any of the foregoing methods; selecting plant cells 
thai have been transformed on a selective medium; regenerating plant ceils that have been 
transformed to produce differentiated plants; and selecting a transformed plant which expresses 
the enzyme-encoding nucleotide sequence. 

Specific methods for transforming a wide variety of dicots and obtaining transgenic 
plants are well documented in the literature (Gasser and Fraley, Science 244: 1293-1299, 1989; 
Fisk and Dandekar, Scientia Horticulturae 55: 5-36, 3993; Chriswu, Agra Food Industry Hi 
Tech, pj 7 (1994); and the references cited therein). 

Successful transformation and plant regeneration have been reported in the monocots as 
follows: asparagus (Asparagus officinalis; Bytebier ei a!., Proc. Natl Acad Sci. U.S.A. 84; 5345- 
5349, .1987); barley (Honteum vulgarae; Wan and Lemaux, Plant Physiol. 104: 37-48. 1994); 
maize (Zea mays; Rhodes, C.A. et ah. Science 240: 204-207, 1988; Gordon-Kantm et al. Plant 
Ceil 2: 603-618, 1990; Fromm, M.E. et aL Bio/Technology 8: 833-839, 1990; Koziel et at, 
Bio/Technology 11: 194-200, 1993); oats (Avem saliva; Somerset al., Bio/Technology 10: 1589- 
1594, 1992); orchardgrass ( [Dactyl 'is ghmerata; Horn et al, Plant Cell Rep. 7: 469-472, 198$;,- 
nee (Oryza saliva, including indica and japonica varieties; Toriyama et al., Bh/Technoiogy 6: 
10, 1988; Zhang et al., Plant Celt Rep. 7; 379-384, 1988; Urn and Wu, Plant Mol Biol Rep. 6: 
165, 1988; Zhang and Wu, Theor. Appl. Genet. 76: 835, 1988; Chmtou et al, Bio/Technology 9: 
957-962, 199.1): rye (Secale cereale; De ia Pena et aL, Nature 325; 274-276, 1987); sorghum 
(Sorghum bicotor, Casas, A.M. et al, Proa Natl. Acad. Sci. U.S.A. 90: 11212-11216, 1993); 
sugar cane (Saccharum spp.; Bower and Birch, Plant J. 2: 409-416, 1992); tail fescue (Festuca 
arundinacea; Wang, Z.Y. et al, Biatfechnohgy 10: 691-696, 1992); turfgrass (Agrmtis 
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palustrts; Zhong et al., CV// /Up. 13: 1-6, 1993}; wheat (ffi/icw aesttvum; Vasil et a!., 
Bio/Technology 10: 667-674, 1992; Weeks, T. et al., iVawf P/ntfio/. 102: 1077-1084, 1993; 
Becker et al., Plant J. 5: 299-307, 1994), and alfalfa (Masoud, S.A. et al., Tramgen. Res. 5: 313, 
1996). 

EXAMPLE I S: Host plants 

Particularly useful plants for poiyhydroxyaikanoate production include those that produce 
carbon substrates which can be employed for poiyhydroxyaikanoate biosynthesis, including 
tobacco, wheat potato, Arabidopsis, and high oil seed plants such as com, soybean, canola, oil 
seed rape, sunflower, flax, peanut, sugarcane, switchgrass, and alfalfa. 

If the host plant of choice does not produce the requisite fatty acid substrates in sufficient 
quantities, it can be modified, for example by mutagenesis or genetic transformation, to block or 
modulate the glycerol ester and tatty acid biosynthesis or degradation pathways so that it 
accumulates the appropriate substrates for polyhydraxyalkanoate production. Expression of 
enzymes such as acyl-ACP thioesierase, fatty aeyl hydroxylase, and yeast MFP may serve to 
increase the flux of substrates in the peroxisome, leading to higher levels of 
po ly hydroxy alkanoate biosynthesis. 

EXAMPLE 16:) 



Variations 



on protein may lead to mutanl 



protein sequences that display equivalent or superior enzymatic characteristics when compared 
to the sequences disclosed herein. "This invention accordingly encompasses nucleic acid 
sequences which are similar to the sequences disclosed herein, protein sequences which are 
similar to the sequences disclosed herein, and the nucleic acid sequences that encode them. 
Mutations may include deletions, insertions, truncations, substitutions, fusions, and the like. 

Mutations to a nucleic acid sequence may be introduced in either a specific or random 
manner, both of which are well known to those of skill in the art of molecular biology. A myriad 
of site-directed mutagenesis techniques exist, typically using oligonucleotides to introduce 
mutations at specific locations in a nucleic acid sequence. Examples include single strand rescue 
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(Kunkel, T. Proc. Natl. Acad. Set. US, A, 82: 488-492, 1985), unique site elimination (Deng and 
Nickloff, Ami. Btoekmi, 200: 81, 1992), nick protection (Vandeyar, et al. Gem 65: 129-133, 
1988), and PCR (Costa, et al. Methods Mol. Biol, 57: 31-44, 1996). Random or non-specific 
mutations may be generated by chemical agents (for a genera! review, see Singer and Kusmierek, 
Ann. Rev. Biochem. 52: 655-693, 1982) such as nitrosoguanidine (Cerda-Olmedo et al., ,/. Mol 
Biol. 33: 705-719, 3968; Guerola, et al. Nature New Biol. 230: 122-125, 1971) and 2- 
arainopurine (Regan and Bessman, J. Bacterial 103: 622-633, 1970), or by biological methods 
such as passage through mutator strains (Greener et al. Mol, BioteehmL 7: 1 89-195, 1 997). 

Nucleic acid hybridization is a technique well known to those of skill in the art of DNA 
manipulation. The hybridization properties of a given pair of nucleic acids is an indication of 
their similarity or identity. Mutated nucleic acid sequences may be selected for their similarity to 
the disclosed nucleic acid sequences on the basis of their hybridization to the disclosed 
sequences. Low stringency conditions may be used to select sequences with multiple mutations. 
One may wish to employ conditions such as about 0.15 M to about 0.9 M sodium chloride, at 
temperatures ranging from about 20°C to about 55°C. High stringency conditions may be used 
to select for nucleic acid sequences with higher degrees of identity to the disclosed sequences. 
Conditions employed may include about 0.02 M to about 0.15 M sodium chloride, about 0.5% to 
about 5% casein, about 0.02% SDS and/or about 0.3% N-lauryisarcosine. about 0.001 M to 
about 0.03 M sodium citrate, at temperatures between about 50°C and about 70°C. More 
preferably, high stringency conditions are 0.02 M sodium chloride, 0.5% casein, 0.02% SDS, 
0.001 M sodium citrate, at a temperature of 50°C. 

i^KAMBLF H: jjaermm ation ofhomologotis and dege nerate nucleic acid sequ ence s 

Modification and changes may he made in the sequence of the proteins of the present 
invention arid the nucleic acid segments which encode them and still obtain a functional 
molecule thai encodes a protein with desirable properties. The following is a discussion based 
upon changing the amino acid sequence of a protein to create an equivalent, or possibly an 
improved, second-generation molecule. The amino acid changes may be achieved by changing 
the codons of the nucleic acid sequence, according to thecodons given in Table 9, 



WO0Q/S21S3 



PCT/US0O/O59M 



-57- 



amino acids 



* % no acid 


* :;ticr 


l -f ^ rcr 


* do , 


FAIunine 


A 


Ala 


til v {,{ t on (,< 


ICysteme 


C 


Cys 


TGC IX VI 


rtk acid 


I> 


Asp 


GA( GAi 


G! > ci 


E 


Gin 


OAA GAG | 


; Phenylalanine 


F 


Phe 


f TC TTT 


Glycine 


0 


Gly 


GGA GGi; GGG GG'f 


tiistidine 


H 


His 


< V « Al 








»J C A FT 


; Lysine 






AAA A AO 


Leuca : 


L 


™ 


1 TTA 1 i G CTA CTC CTG CTT 


M ionine 


M 






I Asparagine 


N 


Asn 


I AAC AAT 


feodine 


P 


Pro 


cca ccc ccc t car 


|Giutamme 
I Arginme 


Q 


Gift 


CAA CAG 1 


R 


Arg 


AGA AG( i 1 c GO CGT| 


| N ~i ine 
{Threonine 


S 
T 


Ser 


agc ag r iCA rcc rec 


fhr 


ACA ACC ACG ACT 


Valine ™ 


V 


Va! 


GTA GTC GTG GTT | 


i f rypt ->phan 


W I 


Tip 


IGG J 


[Tyrosine 


Y | 


Tvr 


I AC T Vl 



Certain amino acids may be substituted for other amino acids in a protein sequence 
without appreciable loss of enzymatic activity. It is thus contemplated that various changes may 
be made in the peptide sequences of the disclosed protein sequences, or their corresponding 
nucleic acid sequences without appreciable loss of the biological activity. 

In making such changes, the hydropathic index of ammo acids may be considered. The 
importance of the hydropathic amino acid index in conferring interactive biological function on a 
protein is generally understood in the art (Kyteand Doolittle, ./. Mai. Biol., 157: 105-132, 1982). 
it ts accepted that the relative hydropathic character of the amino acid contributes to the 
secondary structure of the resultant protein, which in turn defines the interaction of the protein 
with other molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, 
and the like. 

Each amino acid has been assigned a hydropathic index on the basis of its hydrophobic tv 
and charge characteristics. These are: isoleueine (+4.5); valine (+4.2); leucine (+3.8); 
phenylalanine (+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); 
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threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1,3); proline (-1.6); histidine (-3.2); 
glutainate/gluiamine/aspartate/aspar^ine (-3.5); lysine (-3.9); and argmine (-4,5). 

It is known in the art that certain amino acids may be substituted by other amino acids 
having a similar hydropathic index or score and still result in a protein with similar biological 
activity, i.e., still obtain a biologically functional protein. In snaking such changes, the 
substitution of amino acids whose hydropathic indices are within ±2 is preferred, those within ±1 
are more preferred, and those within ±0.5 are most preferred. 

It is also understood in the art that the substitution of like amino acids may be made 
effectively on the basis of hydrophilicity. U.S. Patent No. 4,554, JO! (Hopp, T.P„ issued 
November 19, 1985) states that the greatest local average hydrophilicity of a protein, as governed 
by the hydrophilicity of its adjacent amino acids, correlates with a biological property of the 
protein. The following hydrophilicity values have been assigned to amino acids: argimne/iysine 
(+3.0); aspartate/glutamate (+3.0 ±1); serine (+0.3); asparagme/glutatmne (+0.2); glycine (0); 
threonine (-0.4); proline (-0,5 ±1); alanine/histidtne (-0.5); cysteine (-1.0); methionine (-1.3); 
valine (-1.5); leucine/isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); and tryptophan <- 
3.4). 

It is understood that an amino acid may be substituted by another amino acid having a 
similar hydrophilicity score and still result in a protein with similar biological activity, i.e., still 
obtain a biologically functional protein. la making such changes, the substitution of amino acids 
whose hydropathic indices are within ±2 is preferred, those within ±1 are more preferred, and 
those within +0.5 are most preferred. 

As outlined above, amino acid substitutions are therefore based on the relative similarity 
of the amino acid side-chain substituents, for example, their hydrophobic^', hydrophilicity, 
charge, size, and the like. Exemplary substitutions which take various of the foregoing 
characteristics into consideration are weJl known to those of skill in the art and include: arginme 
and lysine; giutamate and aspartate; serine and threonine; glutamine and asparagine; and valine, 
leucine, and isoleucine. Changes which are not expected to be advantageous may also be used i f 
these resulted in functional fusion proteins. 

All of the compositions and/or methods disclosed and claimed herein can be made and 
executed without undue experimentation in light of the present disclosure. While the 
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compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied to the 
compositions and/or methods and in the steps or in the sequence of steps of the methods 
described herein without departing from the concept, spirit and scope of the invention. More 
specifically, it will be apparent that certain agents which are both chemically and physiologically 
related may be substituted for the agents described herein while the same or similar results would 
be achieved. All such similar substitutes and modifications apparent to those skilled in the art 
are deemed to he wi thin the spirit, scope and concept of the invention. 
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TABLE 10: RESTRICTION SITES FROM FIG. 3 
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TABLE 11: RESTRICTION SITES FROM FIG, 4 
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1 Noil 


""""" 678 


Dral 


5929 


\ Xhol 


687 


Dra! 


5948 


| BsaA.l 


1497 


BsaAl 


7144 


1 Rsrli 


1709 


Dral 


7626 


] Xhol 


2157 


Dral 


7703 


j Dral 


2804 


BglSi 


8389 


L,aAl 


4924 


1 oR, 


84 A 



TABLE 52: RESTRICTION SITES FROM FIG. 5 





i i i sni 


ENZYME 


CUf SIT E 


BsaAl 


f— hi 


Srfl 


. - 1 ■ 


* Noti 


878 


Notl 


2244 


BgM 


1541 


Dral 




EcoR! 


1555 


Dral 


338? 


Smal 


3573 


Drai 


4079 


•"Si,:! 


224d 







WO 00/52183 



PCT/US(WD5931 
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TABLE 13: RESTRICTION SITES FROM FIG. 6 



i «''Y ? ! 


CUT SITE 


ENZ^ ME 


r; 1 


BsaA! 


411 


Smal 


: 


Not! 


878 


Srfl 


3811 


1 BglU 


154? 


Not! 


38 1 5 i 


BsaAl 


2185 


Drai 


4939 


EcoRl 


3094 


Oral 


4958 


EcoRI 


3126 


Oral 


5650 


Smal 


3144 







TABLE 14; RESTRICTION SITES FROM FIG, 7 



ENZYME 


CtJ'i SITE 


ENZYME 


CUT SITE 


BsaAl 


411 


Smal 


3101 


Not! 


878 


Smal 


3768 


BgUI 


1541 


Srrl 


3768 


BsaAl 


2019 


Not; 


3772 


SMI 


2 no 


Oral 


4896 


BsaAl 


2523 


Drai 


4915 


Sbfl 


2789 


Oral 


5607 


EcoRl 


3083 







TABLE 15: RESTRICTION SITES FROM FIG. 8 



r-/j vi _ L i ' : sin- 


ENZYME j CUT SHE. 


j BsaAl 


43 1 


Smal 


3320 


} Not! 


878 


Srfl 


3320 1 


| Bgli! 


554! 


Not; 


3324 1 


Sfil 


2259 


Dral 


4448 | 


EcoRl 


2603 


Drai | 


4467™ ! 


EcoRI 


2635 


Dral 


5159 | 


Smal 


2 <^ 








WO 00/52183 



PCT/USW05931 



TABLE 16: RESTRICTION SITES FROM FIG. 9 



| ENZYME 


(ill S IF 


ENZYME 


c\n sin. 


BsaAl 


411 


Smal 


4029 


Not! 


878 


Srfl 


4029 


Bgfli 


E54I 


Not! 


"' 4033 


J BsaAl 


3070 


Dial 


5157 1 


EcoRl 


3331 


Drai 


5176 1 


BsaAi 


3183 


Dial 


5868 | 



s 

TABLE 17: RESTRICTION SITES FROM FIG. K) 



"1 \ /\ \\. 


' CUT Si i F 


ENZYME 


CUT SITE 


ENZYME 


i' 'Mil 


jlNoii 


693 


I Bgill 


3834 


"' !•• pFH 


11793 


1 HincMI 


04 


Hindlll 


3820 


Sphl 


32986 


1 EcoRV 


1241 


Hindlll 


3832 


Hindili 


13343 i 


\ Bglil 


1 356 


Sphl 


4136 


EcoRV 


13677 ! 


) Hindlll 


I*® — 


BspHl 


4138 


Btdll 


13792 ! 


\ Hindffi 


1374 


Ncol 


5005 


Sphl 


13971 i 


Sphl 


1678 


FcoR: 


... ... 


[ Sphl 


34061 


| Sfil 


2US 


Smai 


5374 


Ncol. 


14066 


i Ncol 


2166 


BamHJ 


5380 


EcoRV 


14277 


\ EcoRl 


2462 


Smal 


6041 


Ncol 


14321 


i Smal 


2480 


Noli 


6045 


BgOl 


14648 


! BamHl 


2486 


Xfaol 




Sex A' 


15269 ~~| 


[ Smal 


314? 






"' " Sfil 




•"' Not! 


3151 






EcoRl 


15946 _| 


Hindi! 


3162 


Xhol 


7522 


BamHl 


15904 "J 


EcoRV 


3699 


BspHl , 


11 293 




,,,,, I 



WO 00/52183 
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TABLE 18: RESTRICTION SITES FROM FIG. 1 1 



j! ENZYME 


1 n ! s ; i i 




I CI IT SITE 


J:JNZ. it Mb 


CI r SITE 




1 693 


Bglil 


f HfS 


BspH! 




i Hindill 


jLT 







Bspfll 


j 1 1824 


f EcoRV 






i 45fe2 


Sphl 


j 13017 ™ 


I BgUt 


! 1356 


Ncol 


4567 


Hindll 


| 13174 




j 1535 


Sfil 


501! 


EcoRV 




\ Sphl 


j 1625 


Ncol 


5059 


Bgiil 




j Ncol 


| 1630 


EcoRl 


5355 


Sphl 


14002 




2508 


EcoRl 


! 5387 


Sphi 


- 14092" j 




2909 


Smal 


5405 


Ncol 


14097 


1 FcoRi 


294 5 


1 BamHi 


54n ... 


EcoRV 


14308 




2959 


Smal 


6072 


Ncol 


14352 


BamHi 


2965 


Not! 


6076 


Bglil 


14679 




3626 


Xhol 


6085 


SexAl 


15300 i 


Not! 


3650 


^ Sphl 


6994 


Sfil 


j™™ — | 

15770 | 


I Hindiil 


3641 


Ncol 


7021 


EcoRl 


15977 j 


| EcoRV 


4178 


Xhol 


7553 


BamHi 


159951 


[JNZYME™ 


TABLE 19: RESTRICTION SITES FROM FIG. 12 




CUT SITE 


ENZYME 




i \AYMl 


CUT SITE 


1 NotT" 


69 j 


EcoRl 




Dral 


lU: 9 


I Bglil 


1356 


LcoRJ 


5344 


BsaAl 




BsaAl 


1834 


Smal 


5362 


Dial 




Sbi! 


1965 


Smal 


6029 


Deal 


13094 j 


BsaAl 


2338 


Srfl \ 


6029 ! 


Bgiil 


"13780 j 


Sbfl 


2604 


Not! ! 


6033 j 


Rsrll 


14338 1 


> u>K; i 


2898 


Xhol j 


6042 j 


Bglil j 


14636 | 


1 Sma! 


2916 


BsaAl 


6850 


Ast! j 


v>m 1 


Smal 


3583 


Rsrll j 


7062 


Se\A3 1 




[ Srfl 


3583 


Xhol { 


7510 


BsaAl 


35273 I 


Not; ~T 


3587 


Oral j 


8157 


Sfil ! 


15"27 1 


Bgiil j 


4250 


BsaAl | 


10315 j 


Sbfl ~J 


15875 J 


Sfil 


4968 


Oral 


11320 J 


EcoRl j 


15934 \ 



WO 60/52183 



PCT/US<KW»593I 



TABLE 20; RESTRICTION SITES FROM FIG. 13 



f ENZYME 


CUT SITE 


ENZYME 


ci sh 


1 \/>\il 




f Not! 


693 




^1 — 


BspFI 


11365 


I Hindi!! 


704 


Sphl 




Bsp.H! 


11865; : 


J EcoRV 


1241 






Sphl 


13058 , 


Belli 


1356 


Ncol 


4608 


Hind)H 


13215 I 


Sphl 


1535 


Siil 


5052 


EcoRV 


13749 1 


Sphl 


1625 


Nco'l 


5100 


Bglll 


13864 | 


Ncoi 


2497 


EcoRi 


5396 


Sphl 


14043 


HindUl 


2938 


EcoRI 


5428 


Sphi 


14133 


EcoRV 


2946 


Smal 


5446 


Ncol 


I413S| 


EcoRI 


2950 


BamHi 


5452 


EcoRV 


14349 


| EcoRi 


2982 


Smal 


6113 




j Smal 


3900 


Noti 


6117 


Bglil 


14720 


BamHi 


3606 


Xhoi 


6126 


SexAi 


15343 


I Smai 






7035 


Sftl 


158U 


Not.! 


3671 


Ncol 


7062 


EcoRi 


160IS 


1 • ndlll | 3682 


Xhoi 


7594 


BamHl 


16036 


EcoRV j 4219 











WO 00/52183 



PCT/USOO/05931 



TABLE 21 : RESTRICTION SITES FROM FIG. 14 



j ENZYME 


! CUT SITE 


i ENZYME 




CUT hilt 


\\>ri 


I 693 


^-BT 


T 5702| Xhol 


i 919? 


Mindlll 


704 


— — 


| 6053 


| SphS 


! 10106 1 


1 EcoRV 


j 24| 


'• ma -. 


6071 


Ncol 


j 10133 ij 


1 Belli 


. .. 


BamHl 


6077 


Xhol 


< 10665 \ 


1 Sphi 


.2^ 


Smal 


6738 


BspHl 


"14436 f 


1 Snhl 





— ul j 


6742 


BspHi 




j; Nco! 




Hindi li 


6753 


Sphi 


16129 | 


| hcoRi 


2946 


EcoRV 


7290 


Hindil 


16286 1 


L SnaB5 


2998 


BgH 


7405 


EcoRV 


16820 


f Ned 


3032 


Sphi 


7x84 


Bgill 


16935 


I EcoRV 


3179 


Sphi 


7674 


_ Sphi 


17114 1 


! BamHl 


3183 




7679 


SpW 


17204 |i 


! Smal 


3844 


SfU 


8123 


Nccl 


17209 J 


Not! 


•S4S 


Ncol 


81?] 


EcoRV 


17420 j 


Hindlli 


3859 


EcoRI 


8467 


Ncol 


17464 j 


EcoRV 


4396 


EeoRi 


8499 


Bgill 


17791 j 


l~ Bglli 


4511 


Smal 


8517 4 


S .xAi 


38432 | 


Hindi!] 


4517 


BamHl 


8523 


Sfil 


18882 1 


Bmdlll 


452« 


Smal 


9184 


EcoRI 


19089 | 


Sphi i 


4833 


Not! 


9188 


BamHl 


1910? 1 


BspHl | 


, | 








™> — J 



WO 08/52183 



PCT/US04MD5931 
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TABLE 22: RESTRICTION SITES FROM FIG. 1 5 



f \/ \ Mi 


CUT SITE 


Xli^Zj I JVIi!. 


CUT SITF 




CUT SITE 


Not! 


IkI 





^22: 


Smai 


1 1672 


1 Hindli 1 




J^j™. 


7003 


B ! ill il 


11678 


i EcoRV 


— T5n — 

— ~4|=1 — 


Mm dill 


7014 


Smai 


12: >9 


Bgll] 


J2£. — 


^ — 


7551 


Notl 


1 2343 i 


— |£~i— 




Bgiil 


7666 


Xhol 


.12352 


\ — Jll — 


1 625 


Hindill 


7672 


Sphi 


13261 


\ — .i~ — 


l^H 


Hindi!! 


7684 


Ncol 


13288 


; 


— 


Sphi 


7988 


Xhol 


13820 


~L- — 


— 


BspHI 


7990 


BspHI 


1759! 




h— ....™.t 


Ncol 


8857 


BspHI 


IS09? 


F k 


;> 1 79 


EcoRl 


9208 


Sphi 


19284 


j BamHI 


3 1 83 




9226 


liindll! 




Smai 


:S44 


BamHI 


9232 


EcoRV 


19975 


Not! ~i 


•848 


Smai 


9893 


Bgll] 


20090 


Mmdlil 


3859 


Not! 


9897 


Sphi 


20269 | 


* EcoRV 


4396 


Hindi!! 


9908 


Sphi 


20359 


Bgiil 


451 1 


EcoRV 


10445 ' 


Ncol 


20364 


Sphi 


4690 


Bgll! 


10560 


EcoRV 


20575 


Sphi 


4780 


Sphi 


10739 


Ncol 


20619 


Ncol 


4785 


Sphi 


10829 


Bgiil 


20946 - , 


EcoRl 


6101 


Ncol 


10834 f 


Sex A! 


21567 j 


SnaBi 


6153 


SfiS 


11278 


SfiJ 


22037 


Ncol 


6187 


Ncol 


11326 


EcoRl 


22244 


EcoRV 


6334 


EcoRl 


11622 


BamHI 


2226? } 


Bam HI 


6338 


EcoRl j 


11654 | 







WOW52183 



PCT/US0O/8S93! 



-67- 



TABEE 23; RESTRICTION SITES FROM FIG, 16 





CUT SITE 


ENZYME 


CUT SITE 


ENZYME 


cut Mir 


j Not! 


678 


Drai 


774 


j BsaAl 


g . .; 


Spel 


685 


Swal 


774 


BsaAl 


I R :t >1 


BsaAl 


693 


Xhol 


779 


Dra! 


8394 s 


S.i::Di 


698 


Drai 


3426 


Bglii 


S583 


! Rsrli 


705 


BsaAl 


5546 


Rsrll 


9140 


j Sex A! 


7.11 


Dra! 


4551 


Bglii 


9438 


! Pad 




Drai 


4570 


Asci 


9993 


Sgfl 


■'30 


BsaAl 


5766 


SexAl 


10059 


Sii! 


74! 


Drai 


6248 


BsaAl 


30075 


Ascl 


748 


Drai 


6325 


sm 


10529 * 


Sbil 


760 


Drai 


6424 


Sbfl 


10677 


Smal 


766 


Pad 


7426 


EcoRl 


10736 


Srtt 


766 


Di i 


7887 







TABLE 24: RESTRICTION SITES FROM FIG. 17 



j ENZYME 


t : ; sue 


ENZYME 


CUT SITE 


ENZYME 


CUT SITE 


1 Notl 


678 


Smal 


i 766 


Drai 


624? 


| Spel 


685 


Srfl 


766 


Drai 


6325 ~* 


BsaAi 


693 


Drai 


774 


Drai 


6424 


j SanDl 


698 


Swal 


774 


Pad 


7426 


Rsril 


705 


Xhol 


779 


j Dra! 


! 7887 


SexAl 


713 


Drai 


1426 


Bs tA: 


8144 


Pad 


722 


BsaAl 


3546 


BsaAl 


8364 


Sgf! 


730 


Drai 


455? 


Di 


8394 


Sill 


741 


Drai 


4570 


Bglii 


8582 ! 


Ascl 


748 


BsaAi 


5766 


EcoRl 


8606 i 


v Shfl , 


760 











WO 00/52 1 83 
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TABLE 25: RESTRICTION SITES FROM FIG. 18 



j ENZYME 


f nj sni 


ENZYME 


COT SITE 


P BsaAF " 


411 


EcoRI 




j Not] 


878 


Smal 


3141 


Drai 


951 


Smal 


3808 


Pad 


1953 


Sr.fl 


3808 


Drai 


2414 


Nofl 


*" 3812 > 


BsaAi 


2671 


Drai 


4936 j 


j BsaAi 


2691 


Drai 


4955 j 


i Drai 


2921 


Drai 


5647 f 


i Bgiil 


3109 




1 



TABLE 26: RESTRICTION SITES FROM i U, 19 



| ENZYME 


VI ■ I M I ! 


ENZYME 


k CUTSIT£ J 


1 BsaAi 


411 




"53 | 


Notl 


878 


FcoRi 


4662 i 


Drai 


951 


Smal 


4680 ; 


Pad 


1953 


Smal 


5347 


' Drai' 


2414 


Sril 




BsaAi 


2671 


Notl 


5351 


BsaAi 


2691 


Drai 


6475 | 


Draf 


2921 


Drai 


6494 j 


._ m. 


3109 


Dra! 


7 " 6 j 



TABLE 27: RESTRICTION SITES FROM FIG. 20 



| ENZYME 
| Noti 


CUT SITE 
... x7 -- 


* enzymTTo I - 
Hlli ~ 


HindHi 


L 889 


EcoRV 


46^0 


| Sphi 


H 1041 


EcoRI 


* 4703 


| "Pad 


1953 


Smal 


~ 4721 


BspHl 


2613 




4727 


BspHi 


2736 


Smal 


5388 


Bgiii 


3109 


J^ojJ_j 




Sphi 


3288 


BspHi 


6477 


1 Sphi 


3378 


BspHl 


7485 




4250 


BspHi 


7590 
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TABLE28: RESTRICTION SITES FROM FIG. 21 



J N/YMl 


CUT SITE 


ENZYME 


CUT SITE 




f tT errr - 
LA J I Mib 1 


\ hoi 




Dral 




SanDI 


4317 


Dral 


280 


Paci 





Rsrii 


4324 j 




280 


Dral 







4330 


Smal 


288 


BsaAl 


2165 


Paci 


4341 


| SrO 


288 


BsaAl 


2185 


Sell 


4349 


1 Sbfl 


298 


Dral 


2415 


Sfil 


4361 


| Ascl 


302 


Spel 


2609 


Asei 


4368 


Sfil 


316 


Smal 


| 2867 


Sbfl 


4380 




326 


Sill 


3315 


Smal 


4386 


i p ac j 




EcoRl 


3608 


Srfl 


4386 J 


i SexAl 


338 


Smal 


3626 


Drai 


4394 | 


Rsrii 


346 


Sma 


4293 


Swal 


4394 


SanDI 


353 


Srfl 


4293 


Dral 


5388 


BsaAl 


361 


Notl 


4297 


Dral 


5407 


SpeJ 


365 


Spel 


4,0 ; 


Drai 


6099 


Not! 


372 


BsaAl 


4312 








TABLE 29: RESTRICTION SITES FROM FIG. 22 






1 N/A \If 


OUT SITE 


ENZYME 


CUT SITE 






BsaAl 


41 1 | 


BsaAl 


4638 






Notl 


878 


EcoRl 


4699 






Drai 




BsaA! 


4751 






Pad 


1953 


Sma! 


5597 






Oral 


2414 


Srfl 


5597 




| BsaAl 


2671 


Not; 


5601 






BsaAl 


2691 


Dral 


6725 






Dral 


2921 


Oral 


6744 






Bglli 


3109 


Dial 


7436 





WO 00/52183 
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TABLE 30: RESTRICTION SITES FROM FIG. 23 



s i\Z i'Mi 


( ( J SI ! f 


ENZYME 




ts>i£ Y Mb 


CI H" SiTE 


Not) 




— fr^TF — 


7,88 


Not! 


1 1596 


Hindill 


689 


— -J*t^ — 




BspH I 


15339 j 


j Sphl 


841 


— |E*M — 




BspH! 


15839 


Pad 


i 753 




7646 


Sphl 


! 7032 


BspHi 


rli! 


Sill 


8094 


Hindlli 


17189 ' 




— 


Ncol 


8142 


Sphi 


17341 


Bglll 






8387 


Pad 


18253 


Split 


3088 


Smai 


8405 


BspHi 


38913 1 


Sphi 


3178 


BamHi 


8411 


BspH! 


19036 


Ncol 


3183 


BamHI 


9358 


Bgiii 


19409 


I Apal 


406? 


EcoRI 


9376 


Sphi 


19588 ! 


EcoRl 


4462 


BspHi 


10162 


Sphi 


19678 


Smal 


44 SO 


Ncol 


10435 


Ncol 


19683 


BamHI 


-I486 


BamHI 


10546 


EcoRV 


19894 


Sinai 


5147 


Ncol ' 




Ncol : 


19938 


| Nod 


5151 


Siil 


10569 


Bglll 


20265 ! 


Hindll? 


5162 


Sphl | 


10757 


SexAi 


20886 


>fihl 


5314 


Bglll ! 


10980 


Sill 


21356 


Pad 


6226 


EeoRl ! 


11052 


EecRi 


21563 


| BspH! 


6886 


EcoRI 


il455 


BamHI 


21581 


f BspHi 


'009 


Hindlli 


i 1585 







WO 00/53183 



PCT/US00/85931 
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TABLE 31: RESTRICTION SITES FROM FIG, 24 



\ \ \\\ 


CUT SITE 


hh/ Mb 


CUT SITE 


r.i\Z t Mr. 


CUT SITE 


| tNOU 


— 12| — . 


BspHi 


7050 


HiiKiiil 


H626 ~ 


1 1 ndi 





Spel 


7429 


Not! 


11637 


Sphl 




EcoRV 


7454 


BspHi 


15380 


Pad 


175 3 


Sphl 


7614 


BspHl 


1.5880 ~1 


■ BspHl 





Sma] 


7687 


Sphi 


17073 




r±?| 


Sfii 


8135 


HindlU 


17230 


Bs;lll 




Ncol 


8183 


Sphi 




Sohl 


3088 


EcoRI 


8428 


Pad 


18294 


Sphl 


3178 


Smai ! 8446 


BspHi 


18954 


Ncoi 


4050 


BamHJ 


8452 


BspHl 


19077 


HindKl 


4491 


BamHl 


9399 


Bgiil 


19450 


EcoRV 


4499 


EcoRI 


>4!7 


SpiU 


19629 


EcoRf 


4503 


BspHi 


10203 


Sphi 


19719 


Smai 


4521 


Ncol 1 10476 


Ncoi 


19724 


BamHl 


4527 


BamHl 


10587 


EcoRV 


19935 


Smai 


5188 


Ncoi 


19590 


Ncol 


19979 


Nod 


5192 


Sfil 


10610 


Bgiii 


20306 


HindUl 


5203 


Sphl 


10798 


SexAl 


20927 


; sphi 


5355 


Bgiii t 


1 1021 


Sfii 


21397 


| f aci 


6267 


i V.>R{ 


1 1 092 


EcoRi j 


21604 j 


BspHl 


6927 


EcoRI 


1 )l»h 


21622 | 



WO 00/52183 
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TABLE 32; RESTRICTION SITES FROM FIG. 25 





CUT SITE 


ENZYM E 


(I I M T i 


j -. \ ?i 


1 !~1 ?T CTTL 

! tUi Mit 


i Not! 


678 


EcoRl 


90 Ft 


— ""iSrl 




| Flindlii 


689 


EcoRV 


9246 


Bgill 


1 5744 


I Sphl 


841 


BamHi 


9250 


EcoR! 


15816 | 


1 Fad 


1753 


Smai 


991 1 


bco5 3.i 


16239 ! 


ij BspHi 


24 1 3 


Not! 


993 5 


- 


"1 6349 j 


1 BspHi 


2536 


Hindlll 


9926 


Not! 


1 6360 


Bgllt 


2909 


Sphl 


100 7 8 


BspHi 


20103 


Sphl 


3088 


Pad 


10990 


BspHi 


20603 


I Sphl 


3178 


BspHl 


H650 


Sphl 


21796 j 


Ncol 


4050 


BspHi 


11773 


Hindlll 


21953 


Hindlll 


TT^V 


Spel 


12152 


Sphi 


22105 


EcoRV 




EcoRV 
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CLAIMS: 

1 - Ati isolated nucleic acid segment comprising: 

a first nucleic acid sequence encoding a poiyhydroxyalkanoate synthase protein; 
a second nucleic acid sequence encoding a jj-ketoaeyl reductase protein; and 
a third nucleic acid sequence encoding a pVketothioiase protein. 

2. The isolated nucleic acid segment of claim 1 , further comprising a fourth nucleic acid 
sequence encoding a threonine deaminase protein. 

3. The isolated nucleic acid segment of claim I , further comprising a fourth nucleic acid 
sequence encoding a deregulated threonine deaminase protein. 

4. The isolated nucleic acid segment of claim L wherein: 

the fust nucleic acid sequence further encodes a chioropiast transit peptide; 

:he s Jum i nucleic acid sequence further encodes a chioropiast transit peptide; and 

the third nucleic acid sequence further encodes a chloroplast transit peptide. 

5. A recombinant vector comprising operative^ linked in the 5' to 3* direction: 

a promoter that directs transcription of a first nucleic acid sequence, a second nucleic acid 

sequence, and a third nucleic acid sequence; 
a first nucleic acid sequence; 
a second nucleic acid sequence; 
a third nucleic acid sequence; 
a 3" transcription terminator; and 
a V polyadeoy Union signal sequence; 
wherein: 

the first nucleic acid sequence, second nucleic acid sequence, and third nucleic 
acid sequence encode different proteins; and 

the first nucleic acid sequence, second nucleic acid sequence, and third nucleic 
acid sequence are independently selected from the group consisting of a 
nucleic acid sequence encoding s polyhydroxyalkanoare synthase protein, 
a nucleic acid sequence encoding a B-ketoacyl reductase protein, and a 
nucleic acid sequence encoding a p-ketothiolase protein. 

S. The recombinant vector of claim 5, wherein the promoter directs transcription of the first 
nucleic acid sequence, the second nucleic acid sequence, and the third nucleic acid 
sequence in plants. 

7. The recombinant vector of claim 5, wherein the promoter is a viral promoter. 

i. The recombinant vector of claim 5, wherein the promoter is a CMV 35 S promoter an 
enhanced CMV 35S promoter, or an FMV 35S promoter. 

K The recombinant vector of claim 5, wherein the promoter is an enhanced CMV 35S 
promoter. 
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The recombinant vector of claim 5, wherein the promoter is a tissue specific promoter. 

The recombinant vector of claim 5, wherein the promoter is a Lesquerella hydroxylase 
promoter or a 7S eongiyemin promoter. 

The recombinant vector of claim 5, wherein the promoter is a Lesquerella hydroxylase 
promoter, 

"Die recombinant vector of claim S. wherein: 

the first nucleic acid sequence further encodes a chloroplast transit peptide: 

the second nucleic acid sequence further encodes a chloroplast transit peptide: and 

the third nucleic acid sequence further encodes a chloroplast transit peptide. 

A recombinant vector comprising: 

a first element comprising operatively linked in the 5' to 3' direction: 

a first promoter that directs transcription of a first nucleic acid sequence; 

a first nucleic acid sequence encoding a polyhydroxyalkanoate synthase protein; 

a first 3' transcription terminator; and 

a first 3' polyadenylation signal sequence; 
a second element comprising operatively linked in the 5 s to V direction: 

a second promoter that directs transcription of a second nucleic acid sequence; 

a second nucleic acid sequence encoding a p-ketoacyl reductase protein; 

a second 3' transcription terminator; and 

- eeond 3 polyadenyl ton ignal sequence; and 
a third element comprising operatively linked in the 5' to 3" direction: 

a third promoter that directs transcription of a third nueieic acid s 

8 *ird nucleic acid sequence encoding a p-ketothiolasc protein; 

a third 3' transcription terminator; and 

a third 3' polyadenylation signal sequence. 

The recombinant vector of claim 14, wherein the fVkeiomiolase protein: 
catalyzes the condensation of two molecules ofaeetyl-CoA to produce aeetoacetyl-CoA; 
and 

catalyzes the condensation of acetyi-CoA and propiom I-CoA to produce [J-ketovaiervl- 
CoA. 

The recombinant vector of claim 14, wherein the pV-fcetoaeyi reductase protein: 
catalyzes the reduction of aeetoacetyl-Co A to P-hydroxybutyryl-CoA; and 
catalyzes the reduction of [Vketovaleryl-CoA to fMivdroxvvaiervh-CoA. 



.yaikanoate synthase protein 



The recombinant vector of claim 14. wherein the pol 
is selected from the group consisting of: 
a polyhydroxyalkanoate synthase protein that catalyzes the incorporation of> 
hydroxybntyry]-CoA into P(3HB) polymer: and 
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a polyhydroxyalkanoate synthase protein Chat catalyzes the incorporation of [3- 
hydroxybutyryl-CoA and ^-bydroxyvaleryl-CoA into P(3HB-co-3HV) 
copolymer. 

The recombinant vector of claim 14, wherein: 

the Mtetothiolase protein comprises a transit peptide sequence that directs transport of 

the jJ-keknhinia.se. protein to the plastid; 
the P-keioacy! reductase protein comprises a transit peptide sequence that directs 

transport of the fj-ketoaeyi reductase protein to die plastid; and 
the polyhydroxyalkanoate synthase protein comprises a transit peptide sequence that 

directs transport of the polyhydroxyalkanoate synthase protein to the plastid. 

The recombinant vector of claim 14, fitrther comprising a nucleic acid sequence encoding 
a threonine deaminase protein. 

The recombinant vector of claim 14, further comprising a nucleic acid sequence encoding 
a deregulated threonine deaminase protein. 

The recombinant vector of claim 14, wherein: 

the first promoter directs transcription of the first nucleic acid sequence in plants; 
the second promoter directs transcription of the second nucleic acid sequence in plains; 
and 

the third promoter directs transcription of the third nucleic acid sequence in plants. 

The recombinant vector of claim 14, wherein the first promoter, second promoter, and 
third promoter are viral promoters. 

The recombinant vector of claim 14, wherein: 

the first promoter is a CMV 35S promoter, an enhanced CMV 35S promoter, or an FMV 
35S promoter; 

the second promoter is a CMV 35S promoter, an enhanced CMV 35S promoter, or an 

FiViV 35S promoter: and 
the third promoter is a CMV 35S promoter, an enhanced CMV 35S promoter, or an FMV 

35S promoter. 

The recombinant vector of claim 14, wherein: 

the first promoter is an enhanced CMV 35S promoter; 

the second promoter is an enhanced CMV 35S promoter; and 

the third promoter is an enhanced CMV 35S promoter. 

The recombinant vector of claim 14, wherein: 
the fust promoter is a tissue specific promoter; 
the second promoter is a tissue specific promoter; and 
the third promoter is a tissue specific promoter. 

The recombinant vector of claim 14, wherein; 
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the first promoter is a LesquereUa hydroxylase promoter or a 78 congiycinin promoter; 
the second promote! \ • o ( /?,< u 'u fo dto^ vi<>t pu motcr or a "S Jon fi^ n m 
promoter; and 

the third promoter is a Lesquereikt hydroxylase promoter or a 7S congiycinin promoter. 

The recombinant vector of claim 14, wherein; 

the first promoter is a Lesquerella hydroxylase promoter; 

the second promoter is a Lesquerelia hydroxylase promoter; and 

file third promoter is a Lesqmrdla hydroxylase promoter. 

The recombinant vector of claim 14, wherein; 

the first nucleic acid sequence farther encodes a ehbroplast transit peptide; 

the second nucleic acid sequence further encodes a ehkm p i md 

the third nucleic acid sequence further encodes a chioroplast transit peptide. 

A transformed host cell comprising a recombinant vector, wherein the recombinant 
vector comprises: 

a first element comprising operativrfy linked in the 5' to 3' direction: 

a first promoter that directs transcription of a first nucleic acid sequence; 

a first nucleic acid sequence encoding a poiyhydroxyalkanoate synthase protein; 

a first 3 ' transcription terminator; and 

a first V polyadenyiation signal sequence; 
a second element comprising operativeiy linked in the 5' to V direction; 

a second promoter that directs transcription of a second nucleic acid sequence; 

a second nucleic acid sequence encoding a p-ketoacy] reductase protein; 

a second 3 ! transcription terminator; and 

a second 3 polyadei ioi < ma! • uence and 
a third element comprising operativeiy linked in the 5' to 3 J direction: 

a third promoter that directs transcription of a third nucleic acid sequence; 

a third nucleic acid sequence encoding a {5-ketothiolase protein; 

a third r Transcription terminator; and 

a third 3' polyadenyiation signal sequence. 

The transformed host ceil of claim 29, wherein the transformed host eel! is a bacterial 
cell 

The transformed host cell of claim 29, wherein the transformed host cel.; is a fungal cell. 
The transformed host ceil of claim 29, wherein the transformed host cell is a plant cell. 
The transformed host cell of claim 29, wherein; 

the first nucleic acid sequence further encodes a chioroplast transit peptide; 

cos-id nucleic acid sequence further encodes a chioroplast transit peptide; an d 
the third nucleic acid sequence forther encodes a chioroplast transit peptide. 
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34. A transformed host cell comprising: 

a first element comprising operatively linked in the 5" to 3' direction: 

a first promoter that directs transcription of a first nucleic acid sequence; 

a first nucleic acid sequence encoding a pol yhydroxyalkanoate synthase protein; 

a first 3' transcription terminator; and 

a first 3 ' polyadenylation signal sequence; 
a second element comprising operatively linked in the 5' to 3' direction: 

a second promoter that directs transcription of a second nucleic acid sequence; 

a second nucleic acid sequence encoding a p-ketoacyl reductase protein; 

a second 3' transcription terminator; and 

a second 3' polyadenylation signal sequence; and 
a third element comprising operatively linked in the 5* to 3 5 direction: 

a thi rd promoter that directs transcription of a third nucleic acid sequence; 

a third nucleic acid sequence encoding a [Jdcetothiolase protein; 

a third 3' transcription terminator; and 

a third 3" polyadenylation signal sequence; 
wherein the first element, second dement, and third element are cointegrated between a 

single left Ti border sequence and a single right Ti border sequence. 

35. The transformed host cell of claim 34, wherein the. transformed host cell is a fungal cell 

36. The transformed host cell of claim 34, wherein the transformed host cell is a plant cell 

37. The transformed host cell of claim 34, wherein the transformed host cell is a tobacco, 
wheat, potato, Arabidopsis > com, soybean, canoa. oil M.eo m;v ,uuiW t U<,\ i mint, 
sugarcane, switehgrass, or alfalfa cell. 

38. fhe transit ( oi i urn 4 wherei 

the first nucleic acid sequence further encodes a chloroplast transit peptide; 

the second nucleic acid sequence further encodes a chloroplast transit peptide; and 

the third nucleic acid sequence further encodes a chloroplast transit peptide. 
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A transformed plant comprising: 

a first element comprising operative!}' linked m the 5' to 3' direction: 

a first promoter that directs transcription of a first nucleic acid sequence; 

a first nucleic acid sequence encoding a polyhydroxyaikanoate synthase protein; 

a lust transcri it < era r <ter.and 

a ftrst 3' polyadenylation signal sequence; 
a second element comprising operative!}' linked in the 5' to 3' direction: 

a second promoter that directs transcription of a second nucleic acid sequence; 

a second nucleic acid sequence encoding a p-ketoacyl reductase protein; 

a second 3 ' transcription terminator; and 

a second 3' polyadenylation signal sequence, and 
a third element comprising operatively linked in the 5' to 3' direction: 

a third promoter that directs transcription of a third nucleic acid sequence; 

a third nucleic acid sequence encoding a {S-ketothiolasc protein; 

a third 3 ' transcription terminator; and' 

a third 3' polyadenylation sigiiai sequence; 
wherein the ftrst element, second element, and third element are eomtegrated between a 

single left Ti border sequence and a single right Ti border sequence. 

The transformed plant of claim 39, wherein the transformed plant is a tobacco, wheat, 
potato, Arahidopsis, corn, soybean, canola, oil seed rape, sunflower, flax, peanut, 
sugarcane, switchgrass, or alfalfa plant 

The transformed plant of claim 39, wherein: 

the first nucleic acid sequence further encodes a chloroplast transit peptide; 

the second nucleic acid sequence further encodes a chloroplast transit peptide; and 

the third nucleic acid sequence further encodes a chioronlast transit peptide. 
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A method of preparing transformed host cells, the method comprising: 
selecting a host cell; 

transforming the selected host cell with a recombinant vector comprising: 

a first element comprising operative!}' linked in the 5' to 3' direction: 

a first promoter that directs transcription of the first nucleic acid sequence; 
a first nucleic acid sequence encoding a poiyhydroxyalkanoate synthase 
protein; 

a first 3' transcription terminator; and 
a first 3' poiyadenylation signal sequence: 
a second element comprising operatively linked in the 5' to 3" direction: 

a second promoter that directs transcription of the second nucleic acid 
sequence; 

a second nucleic acid sequence encoding a f3-ketoacyi reductase protein; 
a second :V transcription terminator; and 
a second 3' poiyadenylation signal sequence; and 
a third element comprising operatively linked in the 5' to 3' direction: 

a third promoter that directs transcription of the third nucleic acid 
sequence; 

a third nucleic acid sequence encoding a P-ketothiolase protein; 
a third 3' transcription terminator; and 
a third 3 * polyadenylation signal sequence; and 
obtaining transformed host cells; wherein the transformed host cells produce 
poiyhydroxyalkanoate polymer. 

A method of preparing transformed host ceils, the method comprising: 
selecting a host cell: 

transforming the selected host cell with a recombinant vector comprising operatively 
linked in the 5' to V direction: 

a promoter that directs transcription of a first nucleic acid sequence, second 
nucleic acid sequence, and third nucleic acid sequence; 

a lust nucleic acid sequence; 

a second nucleic acid sequence; 

a third nucleic acid sequence; 

a 3 ! transcription terminator; and 

a 3' po.h ' i signal sequence; and 

obtaining transformed host ceils; wherein: 

the first nucleic acid sequence, second nucleic acid sequence, and third nucleic 
acid sequence encode different proteins; 

the first nucleic acid sequence, second nucleic acid sequence, and third nucleic 
acid sequence are independently selected from the group consisting of a 
nucleic acid sequence encoding a poiyhydroxyalkanoate synthase protein, 
a nucleic acid sequence encoding a p-ketoacyl reductase protein, and a 
nucleic acid sequence encoding a p-ketothiolase protein; and 

the transformed host cells produce poiyhydroxyalkanoate polymer. 

A method of preparing transformed plan ts, the method comprising: 
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selccting a host plan; eel!; 

transforming the selected host plant ceil with a recombinant vector comprising: 
a first element comprising operatively linked in the 5 J to ?y direction: 

a first promoter that directs transcription of a first nuclei c acid sequence : 
a first nucleic acid sequence encoding a polyhydroxyalkanoate synthase 
protein; 

a first y transcription terminator; and 
a first .V polyadenylation signal sequence; 
a second element comprising operatively linked in the 5* to 3* direction: 

a second promoter that directs transcription of a second nucleic acid 
sequence; 

a second nucleic acid sequence encoding a (3-ketoacyl reductase protein; 
a second 3' transcription terminator; and 
a second 3' polyadenylation signal sequence; and 
a third element comprising operatively linked in the 5' to 3' direction: 

a third promoter that directs transcription of a third nucleic acid sequence; 
a third nucleic acid sequence encoding a fj-ketothiolase protein; 
a third .V transcription terminator; and 
a third 3* polyadenylation signal sequence; 
obtaining transformed host plant cells; and 

regenerating the transformed host plant ceils to produce transformed plants, wherein the 
transformed plants produce polyhydroxyalkanoate polymer. 

A method of preparing transformed plants, the method comprising: 
selecting a host plant cell; 

transforming the selected host plant cell with a recombinant vector comprising 
operatively linked in the 5' to 3* direction: 

a promoter that directs transcription of a first nucleic acid seq uence, second 

nucleic acid sequence, and third nucleic acid sequence; 
a first nucleic acid sequence; 
a second nucleic acid sequence: 
a third nucleic acid sequence; 
a 3' transcription terminator: and 

obtaining transformed host plant cells; and 

regenerating the transformed host plant cells to produce transformed plants; wherein: 
the first nucleic acid sequence, second nucleic acid sequence, and third nucleic 

acid sequence encode different proteins; 
the first nucleic acid sequence, second nucleic acid sequence, and third nucleic 
acid sequence are independently selected from the group consisting of a 
nucleic acid sequence enu dm . p >ht \du walkanu tvsm 
a nucleic acid sequence encoding a 0-ketoacyl reductase protei a . and a 
nucleic acid sequence encoding a pVkerothiolase protein; and 
the transformed plants produce polyhydroxyalkanoate polymer. 

A method of producing polyhydroxyalkanoate comprising: 
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obtaining the transformed host cell of claim 29 or claim 34; 

cuituriog the transformed host cell under conditions suitable for the production of 

poly hydroxy alkanoate; and 
recovering poiyhydroxyalkanoate from the transformed host ceil. 

The method of claim 46, wherein the poiyhydroxyalkanoate is poiv(3-hydroxvbiftyrate), 
poly{4-hydroxy butyrate), or poly(3-hydroxybutyrate-co-4-hyd roxy butyrate) ." 

A method of producing poiyhydroxyalkanoate comprising: 
obtaining the transformed plant of claim 39; 

growing the transformed plant under conditions suitable for the production of 

poiyhydroxyalkanoate; and 
recovering poiyhydroxyalkanoate from the transformed plant. 

The method of claim 48, wherein the poiyhydroxyalkanoate is poly(3-hvdroxybutvmte). 
poly{3-hydroxyvalerate), or pc>ly(34iydroxybtityrate-co-3-hydioxyvaier*ate). * 
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